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@ Procaryotlc carbonyl hydrolases, methods, DNA, vectors and transformed hosts for producing them, and detergent 
compositions containing them. 

@ Methods and vectors ere provided for the production of 

procaryotlc carbonyl hydrolases In recombinant systems. DNA 

which encodes such hydrolases are mutated at predetermined 

regions by known methods or by a novel point mutagenesis 

method in order to generate mutant hydrolases. Particular 

point mutations In carbonyl hydrolases such as subtilisin result 

in modifications of oxidation stability. Km, Kcat, Kcat/Km ratio 

substrate specificity, specific activity or pH-activity profiles. 

These mutated hydrolases ere particularly useful in laundry 

compositions. Mutations in the genes encoding the subtilisin or 
r<? neutral protease of bacillus yield substantially normally sporu- 

lating bacillus strains which are Incapable of excreting subtil- 
^ isln or neutral protease. Such strains are useful in the recombl- 
jjg nam synthesis of heterologous proteins. 

© 

<S2) 
V 3 



® 

m 



ACTORUM AG 



0130756 

LOO/157,241,242,243,247,248 



PROCARYOTIC -CARBONYL « HYDROLASES , -METHODS , - DNA, 
VECTORS AND. TRANSFORMED HOSTS FOR PRODUCING 
THEM, - AND DETERGENT .. COMPOSITIONS CONTAINING THEM 



Background 

This invention relates to the production and manipulation of 
proteins using recombinant techniques in suitable hosts. More 
specifically, the invention relates to the production of procaryotic 
proteases such as subtilisin and neutral protease using recombinant 
microbial host cells, to the synthesis of heterologous proteins by 
microbial hosts, and to the directed mutagenesis of enzymes in order 
to modify the characteristics thereof. 

Various bacteria are known to secrete proteases at some stage in 
their .IJ-fe cycles. Bacillus species produce two major extracellular 
proteases, a neutral protease (a metal! oprotease inhibited by EOTA) 
and an alkaline protease (or subtilisin, a serine endoprotease). 

i * 

Both generally are produced in greatest quantity after the 
exppnentfal growth phase, -when the culture enters stationary , phase 
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and begins the process of sporulation. The physiological role of 
these two proteases is not clear. They have been postulated to play 
a role in sporulation {J. Hoch, 1976, "Adv. Genet-" 18:69-98; 
P. Piggot et a]L, 1976, "Bact. Rev." 40:908-962; and F. Priest, 

5 1977, "Bact. Rev." 41:711-753), to be involved in the regulation of 
cell wall turnover (L. Jolliffe et al_., 1980, "J. Bact." 
141:1199-1208), and to be scavenger enzymes (Priest, Id.). The 
regulation of expression of the protease genes is complex. They 
appear to be coordinately regulated in concert with sporulation, 

10 since mutants blocked in the early stages of sporulation exhibit 
reduced -levels of both the alkaline and neutral protease. 
Additionally, a number of pleiotropic mutations exist which affect 
the level of expression of proteases and other secreted gene 
products, such as amylase and levansucrase (Priest, Id.). 

15 

Subtilisin has found considerable utility in industrial and 
commercial applications (see U.S. Patent No. 3,623,957 and 
J. Millet, 1970, "J. Appl. Bact. M 33:207). For example, subtilisins 
and other proteases are commonly used in detergents to enable 
20 removal of protein-based stains. They also are used in food 

processing to accommodate the proteinaceous substances present in 
the food preparations to their desired impact on the composition. 

Classical mutagenesis of bacteria with agents such as radiation 
25 or chemicals has produced a plethora of mutant strains exhibiting 
different properties with respect to the growth phase at which 
protease excretion occurs as well as the timing and activity levels 
of excreted protease. These strains, however, do not approach the 
ultimate potential of the organisms because the mutagenic process is 
30 essentially random, with tedious selection and screening required to 
identify organisms which even approach the desired characteristics. 
Further, these mutants are capable of reversion to the parent or 
wild-type strain. In such event the desirable property is lost. 
The probability of reversion is unknown when dealing with random 
35 mutagenesis since the type and site of mutation is unknown or poorly 
0992Y 
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characterized. This introduces considerable uncertainty Into the 
industrial process which is based on the enzyme-synthesizing 
bacterium. Finally, classical mutagenesis frequently couples a 
desirable phenotype, e.g. , low protease levels, with an undesirable 
5 character such .as excessive premature cell lysis. 

Special problems exist with respect to the proteases which are 
excreted by Bacillus. For one thing, since at least two such 
proteases exist, screening for the loss of only one is difficult. 
10 Additionally, the large number of pleiotropic mutations affecting 
both sporulation and protease production make the isolation of true 
protease mutations difficult. 

Temperature sensitive mutants of the neutral protease gene have 
15 been obtained by conventional mutagenic techniques, and were used to 
map the position of the regulatory and structural gene in the 
Bacillus subtil is chromosome (H. Uehara et aK , 1979, M. Bact." 
139; 583-590). Additionally, a presumed nonsense mutation of the 
alkaline protease gene has been reported (C. Roitsch et aK, 1983* 
20 "J . Bact. " 155: 145-152 ) . 

Bacillus temperature sensitive mutants have been isolated that 

produce Inactive serine protease or greatly reduced levels of serine 

protease. These mutants, however, are asporogenous and show a 

-7 -ft 

25 reversion frequency to the wild- type of about from 10 to 10 

(F. Priest, Id. p. 719). These mutants are unsatisfactory for the 
recombinant production of heterologous proteins because asporogenous 
mutants tend to lyse during earlier stages of their growth cycle in 
minimal medium than do sporogenic mutants, thereby prematurely 

30 releasing cellular contents (including intracellular proteases) into 
the culture supernatant. The possibility of reversion also is 
undesirable since wild-type revertants will contaminate the culture 
supernatant with excreted proteases. 
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Bacillus sp. have been proposed for the expression of 
heterologous proteins, but the presence of excreted proteases and 
the potential resulting hydrolysis of the desired product has 
retarded the commercial acceptance of Bacillus as a host for the 

5 expression of heterologous proteins. Bacillus megaterium mutants 
have been disclosed that are capable of sporulation and which do not 
express a sporulation-associated protease during growth phases. 
However, the assay employed did not exclude the presence of other 
proteases, and the protease in question is expressed during the 

10 sporulation phase (C. Loshon et a!-, 1982, "J. Bact." 150:303-311) . 
This, of course, is the point at which heterologous protein would 
have accumulated in the culture and be vulnerable. It is an 
objective herein to construct a Bacillus strain that is* 
substantially free of extracellular neutral and alkaline protease 

15 during all phases of its growth cycle and which exhibits 

substantially normal sporulation characteristics. A need exists for 
non-revertible, otherwise normal protease deficient organisms that 
can then be transformed with high copy number plasmids for the 
expression of heterologous or homologous proteins . 

20 

Enzymes having characteristics which vary from available stock 
are required- In particular, enzymes having enhanced oxidation 
stability will be useful in extending the shelf life and bleach 
compatibility of proteases used in laundry products . Similarly, 
25 reduced oxidation stability would be useful in industrial processes 
that require the rapid and efficient- quenching of enzymatic activity. 

Modifying the pH-activity profiles of an enzyme would be useful 
in making the enzymes more efficient in a wide variety of processes, 
30 e.g. broadening the pH-activity profile of a protease would produce 
an enzyme more suitable for both alkaline and neutral laundry 
products. Narrowing the profile, particularly when combined with 
tailored substrate specificity, would make enzymes in a mixture more 

compatible, as will be further described herein. 

35 
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Mutations of procaryotic carbonyl hydrolases (principally 
proteases but including lipases) will facilitate preparation of a 
variety of different hydrolases, particularly those having other 
modified properties such as Km, Kcat, Km/Kcat ratio and substrate 
5 specificity. These enzymes can then be tailored for the particular 
substrate which is anticipated to be present, for example in the 
preparation of peptides or for hydrolytic processes such as laundry 
uses. 

10 Chemical modification of enzymes is known • For example, see I. 
Svendserr, 1976, "Carlsberg Res. Commune" 41_ (5): 237-291 . These 
methods, however, suffer from the disadvantages of being dependent 
upon the presence of convenient amino acid residues, are frequently 
nonspecific in that they modify all accessible residues with common 

15 side chains, and are not capable of reaching inaccessible amino acid 
residues without further processing, e.g. denaturation, that is 
generally not completely reversible in reinstituting activity* To 
the extent that such methods have the objective of replacing one 
amino acid residue side chain for another side chain or equivalent 

20 functionality, then mutagenesis promises to supplant such methodso 

Predetermined, site-directed mutagenesis of tRMA synthetase in 
which a cys residue is converted to serine has been reported 
(G. Winter et aK, 1982, "Nature" 299:756-758; A. Wilkinson et a].., 
25 1984, "Nature" 307:187-188). This method is not practical for large 
scale mutagenesis. It is an object herein to provide a convenient 
and rapid method for mutating DNA by saturation mutagenesis. 

30 Summary 

A method for producing procaryotic carbonyl hydrolase such as 
subtilisin and neutral protease in recombinant host cells is 
described in which expression vectors containing sequences which 
35 encode desired subtilisin or neutral protease, including the pro, 
0992Y 
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pre, or prepro forms of these enzymes, are used to transform hosts, 
the host cultured and desired enzymes recovered. The coding 
sequence may correspond exactly to one found in nature, or may 
contain modifications which confer desirable properties on the 
protein that is produced, as is further described below. 

The novel strains then are transformed with at least one DNA 
moiety encoding a polypeptide not otherwise expressed in the host 
strain, the transformed strains cultured and the polypeptide 
recovered from the culture* Ordinarily, the DNA moiety is a 
directed- mutant of a host Bacillus gene, although it may be DNA 
encoding a eucaryotic (yeast. or mammalian) protein. The novel 
strains also serve as hosts for protein expressed from a bacterial 
gene derived from sources other than the host genome, or for vectors 
expressing these heterologous genes, or homologous genes from the 
host genome. In the latter event enzymes such as amylase are 
obtained free of neutral protease or subtil isin. In addition, it is 
now possible to obtain neutral protease in culture which is free of 
enigmatically active subtilisin, and vice-versa. 

One may, by splicing the cloned genes for procaryotic carbonyl 
hydrolase into a high copy number plasmid, synthesize the enzymes in 
enhanced yield compared to the parental organisms . Also disclosed 
are modified forms of such hydrolases, including the pro and prepro 
zymogen forms of the enzymes, the pre forms, and directed mutations 
thereof. 

A convenient method is provided for saturation mutagenesis, 
thereby enabling the rapid and efficient generation of a plurality 
of mutations at any one site within the coding region of a protein, 
comprising; 

(a) obtaining a DNA moiety encoding at least a portion of 
said precursor protein; 
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(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already existing 
within the region in order to create at least one 

5 restriction enzyme site unique to the moiety, whereby unique 

restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

(d) synthesizing a plurality of oligonucleotides, the 5' and 
3 1 ends of which each contain sequences capable of annealing 
to the restriction enzyme sites introduced in step (c) and 
which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 
amino acid in or into said precursor protein; 

(e) digesting the moiety of step (c) with restriction 
enzymes capable of cleaving the unique sites; and 

20 (f) ligating each of the oligonucleotides of step (d) into 

the digested moiety of step (e) whereby a plurality of 
mutant DNA moieties are obtained. 

By the foregoing method or others known in the art, a mutation 
is introduced into Isolated DNA encoding a procaryotic carbonyl 
hydrolase which, upon expression of the DNA, results in the 
substitution, deletion or Insertion of at least one amino acid at a 
predetermined site in the hydrolase. This method is useful in 
creating mutants of wild type proteins (where the "precursor" 
protein is the wild type) or reverting mutants to the wild type 
(where the "precursor" is the mutant. 

Mutant enzymes are recovered which exhibit oxidative stability 
and/or pH-activity profiles which differ from the precursor 
35 enzymes- Procaryotic carbonyl -hydrolases having varied Km, Kcat, ... 
Kcat/Km ratio and substrate specificity also are provided herein. 
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The mutant enzymes obtained by the methods herein are combined 
in known fashion with surfactants or detergents to produce novel 
compositions useful in the laundry or other cleaning arts. 

5 

Brief Description of the Drawings 

Figure 1 shows the sequence of a functional B* amyloliquefaciens 
subtil isin gene. 

10 

In Figure 1A, the entire functional sequence for Eh 
amyloliquefaciens , including the promoter and ribosome binding site, 
are present on a 1.5 kb fragment of the amyloliquefaciens genome. 

15 Figure IB shows the nucleotide sequence of the coding strand, 
correlated with the amino acid sequence of the protein. Promoter 
(p) ribosome binding site (rbs) and termination (term) regions of 
the DNA sequence are also shown. 

20 Figure 2 shows the results of replica nitrocellulose filters of 
purified positive clones probed with Pool 1 (Panel A) and Pool 2 
(Panel B) respectively. 

Figure 3 shows the restriction analysis of the subtil isin 
25 expression plasmid (pS4). pBS42 vector sequences (4.5 kb) are shown 
in solid while the insert sequence (4.4 kb) is shown dashed. 

Figure 4 shows the results of SDS-PAGE performed on supernatants 
from cultures transformed with pBS42 and pS4. 

30 

Figure 5 shows the construction of the shuttle vector pBS42. 

Figure 6 shows a restriction map for a sequence including the B. 
subtil is subtil isin gene. 

35 
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Figure 7 is the sequence of a functional B. subtil is subtil i sin 
gene. 

Figure 8 demonstrates a construction method for obtaining a 
deletion mutant of a subtil is subtilisin gene. 

Figure 9 discloses the restriction map for a B. subtil is neutral 
protease gene. 

Figure 10 is the nucleotide sequence for a subtil is neutral 
protease gene. 

Figure 11 demonstrates the construction of a vector containing a 
— * subtil is neutral protease gene. 

Figures 12, 13 and 16 disclose embodiments of the mutagenesis 
technique provided herein. 

Figure 14 shows the enhanced oxidation stability of a subtilisin 
mutant. 

Figure 15 demonstrates a change in the pH-activity profile of a 
subtilisin mutant when compared to the wild type enzyme* 

Detailed Description 

Procaryotic carbonyl hydrolases are enzymes which hydrolyze 

0 
u 

compounds containing C-X bonds in which X is oxygen or nitrogen. 
They principally include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subtilisins or metal! oproteases. Peptide 
hydrolases include a-aminoacyl peptide hydrolase, pepti calami no-acid 
hydrolase, acylamino hydrolase, serine carboxypeptidase, 
metal locarboxypeptidase, thiol proteinase, carboxyl proteinase and 
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metal loproteinase. Serine, metallo, thiol and acid proteases are 
included, as well as endo and exo-proteases. 

Subtilisins are serine proteinases which generally act to cleave 
internal peptide bonds of proteins or peptides. Metal loproteases 
are exo- or endoproteases which require a metal ion cof actor for 
activity. 

A number of naturally occurring mutants of subtilisin or neutral 
protease exist, and all may be employed with equal effect herein as 
sources for starting genetic material. 



These enzymes and their genes may be obtained from many 
procaryotic organisms. Suitable examples Include gram negative 
15 organisms such as E. coli or pseudomonas and gram positive bacteria 
such as micrococcus or bacillus. 

The genes encoding the carbonyl hydrolase may be obtained in 
accord with the general method herein. As will be seen from the 

2 q examples, this comprises synthesizing labelled probes having 

putative sequences encoding regions of the hydrolase of interest, 
preparing genomic libraries from organising expressing the 
hydrolase, and screening the libraries for the gene of interest by 
: hybridization to the probes. Positively hybridizing clones are then 

25 mapped and sequenced. The cloned genes are li gated into an 
expression vector (which also may be. the cloning vector) with 
requisite regions for replication in the host, the plasmid 
transfected into a host for enzyme synthesis and the recombinant 
host cells cultured under conditions favoring enzyme synthesis, 

3 q usually selection pressure such as is supplied by the presence of an 
antibiotic, the resistance to which is encoded by the vector. 
Culture under these conditions results in enzyme yields multifold 
greater than the wild type enzyme synthesis of the parent organism, 
even if it is the parent organism that is transformed. 

35 
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"Expression vector" refers to a DNA construct containing a DNA 
sequence which is operably linked to a suitable control sequence 
capable of effecting the expreission of said DNA in a suitable hosto 
Such control sequences include a promoter to effect transcription, 

5 an optional operator sequence to control such transcription, ia 
sequence encoding suitable.mRNA ribosome binding sites, and 
sequences which control termination of transcription and 
translation. The vector may be a plaismid, a phage particle, or 
simply a potential genomic insert. Once transformed into a suitable 

10 host, the vector may replicate and function independently of the 
host genome, or may, in some instances, integrate into the genome 
itself .. In the present specification, "plasmid" and "vector" are 
sometimes used interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the Invention 

15 is intended to include such other forms of expression vectors which 
serve equivalent functions and which are, or become, known in the 
art. 

"Recombinant host cells" refers to cells which have been 
20 transformed or transfected with vectors constructed using recombinant 
DNA techniques. As relevant to the present invention, recombinant 
host cells are those which produce procaryotic carbonyl hydrolases 
in its various forms by virtue of having been transformed with 
expression vectors encoding these proteins. The recombinant host 
25 cells may or may not have produced a form of carbooyl hydrolase 
prior to transformation. 

"Operably linked" when describing the relationship between two 
DNA regions simply means that they are functionally related to each 

30 other. For example, a presequence is operably linked to a peptide 
1f it functions as a signal sequence, participating in the secretion 
of the mature form of the protein most probably involving cleavage 
of the signal sequence. A promoter is operably linked to a coding 
sequence if it controls the transcription of the sequence; a 

35 ribosome binding site is operably linked to a coding sequence if it 
is positioned so as to permit translation. 
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"Prohydrolase" refers to a hydrolase which contains additional 
N-terminal amino acid residues which render the enzyme inactive but, 
when removed, yield an enzyme. Many proteolytic engines are found 
in nature as translational proenzyme products and, in the absence of 
post-translational products, are expressed in this fashion. 

"Presequence" refers to a signal sequence of amino acids bound 
to the N-terminal portion of the hydrolase which may participate in 
the secretion of the hydrolase- Presequences also may be modified 
in the same fashion as is described here, including the introduction 
of predetermined mutations. When bound to a hydrolase, the subject 
protein becomes a "prehydrolase"* Accordingly, relevant 
prehydrolase for the purposes herein are presubtilisin and 
preprosubtilisiiu Prehydrolases are produced by deleting the "pro" 
sequence (or at least that portion of the pro sequence that 
maintains the en?yme in its inactive state) from a prepro coding 
region, and then expressing the prehydrolase. In this way the 
organism excretes the active rather than proen2yme<> 

The cloned carbonyl hydrolase is used to transform a host cell 
in order to express the hydrolase. This will be of interest where 
the hydrolase has commercial use in its unmodified form, as for 
example subtilisin in laundry products as noted above. In the 
preferred embodiment the hydrolase gene is ligated into a high copy 
number plasmid. This plasmid replicates in hosts in the sense that 
it contains the well-known elements necessary for plasmid 
replication: a promoter operably linked to the gene in question 
(which may be supplied as the gene's own homologous promotor if it 
is recognized, i.e., transcribed, by the host), a transcription 
termination and polyadenylation region (necessary for stability of 
the mRNA transcribed by the host from the hydrolase gene) which is 
exogenous or is supplied by the endogenous terminator region of the 
hydrolase gene and, desirably, a selection gene such as an 
antibiotic resistance gene that enables continuous cultural 
maintenance of plasmid-infected host cells by growth in 
0992Y 
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antibiotic-containing media. High copy number plasmids also contain 
an origin of replication for the host, thereby enabling large 
numbers of plasmids to be generated in the cytoplasm without 
chromosonal limitations. However, 1t is within the scope herein to 
5 integrate multiple copies of the hydrolase gene into host genome. 
This is facilitated by bacterial strains which are particularly 
susceptible to homologous recombination. The resulting host cells 
are termed recombinant host cells. 

10 Once the carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the gene beyond 
synthesis of the wild type or precursor enzyme. A precursor enzyme 
is the enzyme prior to its modification as described in this 
application. Usually the precursor is the enzyme as expressed by 

15 the organism which donated .the- DMA modified in accord herewith. The 
term "precursor" is to be understood as not implying that the 
product enzyme was the result of manipulation of the precursor 
enzyme £er se. 

20 In the first of these modifications, the gene may be deleted 
from a recombination positive (rec + ) organism containing a 
homologous gene.. This is accomplished by recombination of an in 
vitro deletion mutation of the cloned gene with the genome of the 
organism. Many strains of organisms such as E. coli and Bacillus are 

25 known to be capable of recombination. All that is needed is for 
regions of the residual DNA from the deletion mutant to recombine 
with homologous regions of the candidate host. The deletion may be 
within the coding region (leaving enzymatically inactive 
polypeptides) or include the entire coding region as long as 

30 homologous flanking regions (such as promoters or termination 
regions) exist in the host. Acceptability of the host for 
recombination deletion mutants is simply determined by screening for 
the deletion of the transformed phenotype. This is most readily 
accomplished in the case of carbonyl hydrolase by assaying host 

35 cultures for loss of the ability to cleave a chromogenic substrate 
otherwise hydrolyzed by the hydrolase. 
0992Y 
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Transformed hosts contained the protease deletion mutants are 
useful for synthesis of products which are incompatible with 
proteolytic enzymes. These hosts by definition are incapable of 
excreting the deleted proteases described herein, yet are 

5 substantially normally. sporulating. Also the other growth 
characteristics of the transformants are substantially like the 
parental organism. Such organisms are useful in that it is expected 
they will exhibit comparatively less inactivation of heterologous 
proteins than the parents, and these hosts do have growth 

10 characteristics superior to known protease-deficient organisms. 
However/ the deletion of neutral protease and subtilisin as 
described in this application does not remove all of the proteolytic 
activity of Bacillus. It is believed that intracellular proteases 
which are not ordinarily excreted extracellularly "leak" or diffuse 

15 from the cells during .late phases of the culture. These 

intracellular proteases may or may not be subtilisin or neutral 
protease as those enzymes are defined herein. Accordingly, the 
novel Bacillus strains herein are incapable of excreting the 
subtilisin and/or neutral protease enzymes which ordinarily are 

20 excreted extracellularly in the parent strains. "Incapable" means 
not revertible to the wild type. Reversion is a finite probability 
that exists with the heretofore known protease-deficient, naturally 
occurring strains since there is no assurance that the phenotype of 
such strains is not a function of a readily revertible mutation, 

25 e.g. a point mutation. This to be contrasted with the extremely 
large deletions provided herein; 

The deletion mutant-transformed host cells herein are free of 
genes encoding enigmatically active neutral protease or subtilisin, 
30 which genes are defined as those being substantially homologous with 
the genes set forth in Figs. 1, 7 or 10. "Homologous" genes contain 
coding regions capable of hybridizing under high stringency 
conditions with the genes shown in Figs. 1, 7 or 10. 
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The microbial strains containing carbonyl hydrolase deletion 
mutants are useful in two principal processes. In one embodiment 
they are advantageous in the fermentative production of products 
ordinarily expressed by a host that are desirably uncontaminated 
5 with the protein encoded by the deletion gene. An example is 
fermentative synthesis of anylase, where contaminant proteases 
interfere in many industrial uses for amylase- The novel strains 
herein relieve the art from part of the burden of purifying such 
products free of contaminating carbonyl hydrolases. 

10 

In a* second principal embodiment, subtilisin and neutral 
protease deletion-mutant strains are useful in the synthesis of 
protein which is not otherwise encoded by the strain • These 
proteins will fall within one of two classeso The first class 

15 consists of proteins encoded by genes exhibiting no substantial 
pretransformation homology with those of the hosto These may be 
proteins from other procaryotes but ordinarily are eucaryptic 
proteins from yeast or higher eucaryotic organisms, particularly 
mammals. The novel strains herein serve as useful hosts for 

20 expressible vectors containing genes encoding such proteins because 
the probability for proteolytic degradation of the expressed, 
non-homologous proteins is reduced. 

The second group consists of mutant host genes exhibiting 
25 substantial pretransformation homology with those of the host.. 
These include mutations of procaryotic carbonyl hydrolases such as 
subtilisin and neutral protease, as well as microbial (rennin, for 
example rennin from the genus Mucor). These mutants are selected in 
order to improve the characteristics of the precursor enzyme. for 
30 industrial uses. 

A novel method is provided to facilitate the construction and 
identification of such mutants. First, the gene encoding the 
tydrolase is obtained and sequenced in whole or in part. Then the 
35 sequence is scanned for a point, at which it is desired to make a 
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mutation (deletion, insertion or substitution) of one or more amino 
acids in the expressed enzyme. The sequences flanking this point 
are evaluated for the presence of restriction sites for replacing a 
short segment of the gene with an oligonucleotide pool which when 
5 expressed will encode various mutants- Since unique restriction 
sites are generally not present at locations within a convenient 
distance from the selected point (from 10 to 15 nucleotides), such 
sites are generated by substituting nucleotides in the gene in such 
a fashion that neither the reading frame nor the amino acids encoded. 

10 are changed in the final construction- The task of locating 
suitable flanking regions and evaluating the needed changes to 
arrive at two unique restriction site sequences is made routine by 
the redundancy of the genetic code, a restriction enzyme map of the 
gene and the large number of different restriction enzymes. Note 

15 that if a fortuitous flanking unique restriction site is available, 
the above method need be used only in connection with the flanking 
region which does not contain a site- 
Mutation of the gene in order to change its sequence to conform 

20 to the desired sequence is accomplished by H13 primer extension in 
accord with generally known methods- Once the gene is cloned, it is 
digested with the unique restriction enzymes and a plurality of end 
termini-complementary oligonucleotide cassettes are ligated into the 
unique sites- The mutagenesis is enormously simplified by this 

25 method because all of the oligonucleotides can be synthesized so as 
to have the same restriction sites,. and no synthetic linkers are 
necessary to create the restriction sites- 

The number of commercially available restriction enzymes having 
30 sites not present in the gene of interest is generally large- A 

suitable DMA sequence computer search program simplifies the task of 
finding potential 5' and 3' unique flanking sites- A primary 
constraint is that any mutation introduced in creation of the 
restriction site must be silent to the final constructed amino acid 
35 coding sequence. For a candidate restriction site 5' to the target 
0992Y 
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codon a sequence must exist in the gene which contains at least all 
the nucleotides but for one in the recognition sequence 5 1 to the 
cut of the candidate enzyme. For example, the blunt cutting enzyme 
Smal (CCC/GGG) would be a 5 1 candidate if a nearby 5 1 sequence 

5 contained NCC, CNC, or CCN. Furthermore, if N needed to be altered 
to C this alteration must leave the amino acid coding sequence 
intact. In cases where a permanent silent mutation is necessary to 
introduce a restriction site one may want to avoid the introduction 
of a rarely used codon. A similar situation for Smal would apply 

10 for 3 1 flanking sites except the sequence NGG, GNG, or GGN must 
exist. -The criteria for locating candidate enzymes is most relaxed 
for blunt cutting enzymes and most stringent for 4 base overhang 
enzymes. In general many candidate sites are available- For the 
codon-222 target described herein a Ball site (TGG/CCA) could have 

15 been engineered in one base pair 5 1 from the Kpnl site. A 3' EcoRV 
site (GAT/ATC) could have been employed 11 base pairs 5 1 to the PstI 
site. A cassette having termini ranging from a blunt end up to a 
four base-overhang will function without difficulty. In retrospect, 
this hypothetical EcoRV site would have significantly shortened the 

20 oligonucleotide cassette employed (9 and 13 base pairs) thus 
allowing greater purity and lower pool bias problems. Flanking 
sites should obviously be chosen which cannot themselves li gate so 
that ligation of the oligonucleotide cassette can be assured in a 
single orientation. 

25 

The mutation £er se need not be predetermined. For example, an 
oligonucleotide cassette or fragment is randomly imutagenized with 
nitrosoguanidine or other mutagen and then in turn ligated into the 
hydrolase gene at a predetermined location. 

30 

The mutant carbonyl. hydrolases expressed upon transformation of 
the suitable hosts are screened for enzymes exhibiting desired 
characteristics, e.g. substrate specificity, oxidation stability, 
pH-activity profiles and the like. 

- 35 
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A change in substrate specificity is defined as a difference 
between the Kcat/Km ratio of the precursor enzyme and that of the 
mutant. The Kcat/Km ratio is a measure of catalytic efficiency. 
Procaryotic carbonyl hydrolases with increased or diminished Kcat/Km 
5 ratios are described in the exampleso Generally, the objective will 
be to secure a mutant having a greater (numerically larger) Kcat/Km 
ratio for a given substrate, thereby enabling the use of the en2yme 
to more efficiently act on a target substrate* An increase in 
Kcat/Km ratio for one substrate may be is accompanied by a reduction • 
10 1n Kcat/Km ratio for another substrate. This Is a shift in 
substrate specificity, and mutants exhibiting such shifts have 
utility where the precursors are undesirable, e.g<> to prevent 
undesired hydrolysis of a particular substrate in an admixture of 
substrates. 

15 

Kcat and Km are measured in accord with known procedures, or as 
described in Example 18. 

Oxidation stability is a further objective which is accomplished 
20 by mutants described 1n the examples. The stability may be enhanced 
or diminished as is desired for various uses. Enhanced stability is 
effected by deleting one or more methionine, tryptophan, cysteine or 
lysine residues and, optionally, substituting another amino acid 
residue not one of methionine, tryptophan, cysteine or lysine. The 
25 opposite substitutions result in diminished oxidation stability. 
The substituted residue is preferably alanyl, but neutral residues 
also are suitable. 

Mutants are provided which exhibit modified pH-activity 
30 profiles* A pH-activity profile 1s a plot of pH against enzyme 
activity and may be constructed as illustrated in Example 19 or by 
methods known in the art. It may be desired to obtain mutants with 
broader profiles,, i.e., those having greater activity at certain pH 
than the precursor, but no significantly greater activity at any pH, 
35 or mutants with sharper profiles, i.e. those having enhanced 
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activity when compared to the precursor at a given pH p and lesser 
activity elsewhere. 

The foregoing mutants preferably are made within the active site 
of the enzyme as these. mutations are most likely to influence 
activity. However, mutants at other sites important for engine 
stability or conformation are useful. In the case of Bacillus 
subtilisin or its pre, prepro and pro forms, mutations at tyrosine-1, 
aspartate+32, asparagine+155, tyrosine+104, methionine+222, 
glycine+166, histidine+64, glycine+169, phenyl al am" ne+189, serine+33, 
serine+221, tyrosine+217, glutamate+156 and/or alanine+152 produce 
mutants having changes in the characteristics described above or in 
the processing of the enzyme. Note that these amino acid position 
numbers are those assigned to B. amy! oliquefaci ens subtilisin as 
seen from Fig. 7. It should be understood that a deletion or 
insertion in the N-terminal direction from a given position will 
shift the relative amino acid positions so that a residue will not 
occupy its original or wi]d type numerical position. Also, allelic 
differences and the variation among various procaryotic species will 
result in positions shifts, so that position 169 in such subtilisins 
will not be occupied by glycine. In such cases the new positions 
for glycine will be considered equivalent to and embraced within the 
designation glycine+169. The new position for glycine+169 is 
readily identified by scanning the subtilisin in question for a 
region homologous to glycine+169 in Fig. 7. 

One or more, ordinarily up to about 10, amino acid residues may 
be mutated. However, there is no limit to the number of mutations 
that are to be made aside from commercial practicality. 

The enzymes herein may be obtained as salts. It is clear that 
the ionization state of a protein will be dependent on the pH of the 
surrounding medium, if it is in solution, or of the solution from 

which it is prepared, if it is in solid form. Acidic proteins are 
commonly prepared as, for example, the ammonium, sodium, or 
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potassium salts; basic proteins as the chlorides, sulfates, or 
phosphates." Accordingly, the present application includes both 
electrically neutral and salt forms of the designated carbonyl 
hydrolases, and the term carbonyl hydrolase refers to the organic 
5 structural backbone regardless of ionization state. 

The mutants are particularly useful in the food processing and 
cleaning arts. The carbonyl hydrolases, including mutants, are 
produced by fermentation as described herein and recovered by 

10 suitable techniques. See for example K. Anstrup, 1974, Industrial 
Aspects of Biochemistry , ed. B. Spencer pp. 23-46. They are 
formulated with detergents or other surfactants in accord with 
methods known per se for use in industrial processes, especially 
laundry. In the latter case the enzymes are combined with 

15 detergents, builders,. bleach and/or fluorescent whitening agents as 
is known in the art for proteolytic enzymes. Suitable detergents 
include linear alkyl benzene sulfonates, alky! ethoxylated sulfate, 
sulfated linear alcohol or ethoxylated linear alcohol. The 
compositions may be formulated in granular or liquid form. See for 

20 example U.S Patents 3,623,957; 4,404,128; 4,381,247; 4,404,115; 
4,318,818; 4,261,868; 4,242,219; 4,142,999; 4,111,855; 4,011,169; 
4,090,973; 3,985,686; 3,790,482; 3,749,671; 3,560,392; 3,558,498; 
and 3,557,002. 

25 The following disclosure is intended to serve as a 

representation of embodiments herein, and should not be construed as 
limiting the scope of this application. 

30 Glossary of Experimental Manipulations 

In order to simplify the Examples certain frequently occurring 
methods will be referenced by shorthand phrases. 

35 
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Plasmids are designated by a small p preceeded and/or followed 
by capital letters and/or numbers. The starting plasmids herein are 
commercially available, are available on an unrestricted basis, or 
can be constructed from such available plasmids in accord with 
5 published procedures. 

"Klenow treatment" refers to the process, of filling a recessed 
3' end of double stranded DMA with deoxy ribonucleotides 
complementary to the nucleotides making up the protruding 5' end of 

10 the DMA strand. This process is usually used to fill in a recessed 
end restrlting from a restriction enzyme cleavage of DMA. This 
creates a blunt or flush end, as may be required for further 
ligations. Treatment with Klenow is accomplished by reacting 
(generally for 15 minutes at 15°C) the appropriate complementary 

15 deoxyribonucleotides with the DMA to be filled in under the 
catalytic activity (usually 10 units) of the Klenow fragment of 
E. coli DNA polymerase I ("Klenow"). Klenow and the other reagents 
needed are commercially available. The procedure has been published 
extensively. See for example T. Maniatis et aK, 1982, Molecular 

20 Cloning , pp. 107-108. 

"Digestion" of DNA refers to catalytic cleavage of the DMA with 
an enzyme that acts only at certain locations in the DNAo Such 
enzymes are called restriction enzymes, and the sites for which each 

25 is specific is called a restriction site. "Partial" digestion 
refers to incomplete digestion by a restriction enzyme, i.e., 
conditions are chosen that result in cleavage of some but not all of 
the sites for a given restriction endonuclease in a DNA substrate. 
The various restriction enzymes used herein are commercially 

30 available and their reaction conditions, cofactors and other 
requirements as established by. the enzyme suppliers were used. 
Restriction enzymes commonly are designated by abbreviations 
composed of a capital letter followed by other letters and then, 
generally, a number representing the microorganism from which each 

35 restriction enzyme originally was obtained- In general, about 1 M g 
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of plasmid or DMA fragment 1s used with about 1 unit of enzyme in 
about 20 ul of buffer solution. Appropriate buffers and substrate 
amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37°C are 

5 ordinarily used, but may vary in accordance with the supplier's 
instructions. After incubation, protein is removed by extraction 
with phenol and chloroform, and the digested nucleic acid is 
recovered from the aqueous fraction by precipitation with ethanol. 
Digestion with a restriction enzyme infrequently 1s followed with 

10 bacterial alkaline phosphatase hydrolysis of the terminal 5 1 
phosphates to prevent the two restriction cleaved ends of a DNA 
fragment from "circularizing" or forming a closed loop that would 
impede insertion of another DNA fragment at the restriction site. 
Unless otherwise stated, digestion of plasmids is not followed by 5 1 

15 terminal dephosphorylation. Procedures and reagents for 

dephosphorylation are conventional (T. Maniatis et al_., Id., 
pp. 133-134). 

"Recovery" or "isolation" of a given fragment of DNA from a 
20 restriction digest means separation of the digest on 6 percent 

polyacryl amide gel electrophoresis, identification of the fragment 
of interest by molecular weight (using DNA fragments of known 
molecular weight as markers), removal of the gel section containing 
the desired fragment, and separation of the gel from DNA. This 
25 procedure 1s known generally* For example, see R. Lawn et a1 «, 
1981, "Nucleic Acids Res." 9:6103-6114, and Do Goeddel et aL,, 
(1980) "Nucleic Acids Res." 8:4057. 

"Southern Analysis" is a method by which the presence of DNA 
sequences 1n a digest or DNA-containing composition is confirmed by 
hybridization to a known, labelled oligonucleotide or DNA fragment 
For the purposes herein, Southern analysis shall mean separation of 
digests on 1 percent agarose and depurination as described by 
G. Wahl et a]., 1979, "Proc. Nat. Acad. Sci. U.S.A." 76:3683-3687, 
transfer to nitrocellulose by the method of E. Southern, 1975, 
0992Y 
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"J* Mole Biol." 98:503-517, and hybridization as described by 
T. Maniatis etaK, 1978, "Cell" 15:687-701 . 

"Transformation" means introducing DNA into an organism so that 
5 the DMA is replicable, either as an extrachromosomal element or 
chromosomal integrant. Unless otherwise stated, the method used 
herein for transformation of E, coli is the CaCl 2 method of Mandel 
et al_o, 1970, "J, MoT. Biol." 53:154, and for Bacillus, the method 
of Anagnostopolous et al_., 1961, "J. Bact." 81:791-746. 

10 

"Ligation" refers to the process of forming phosphodiester bonds 
between two double stranded nucleic acid fragments (T. Maniatis 
et a]L, Id., p 0 146) o Unless otherwise stated, ligation was 
accomplished using known buffers and conditions with 10 units of T4 
15 DNA ligase ("ligase") per 0o5 ug of approximately equimolar amounts 
of the DNA fragments to be li gated. Plasmids from the transformants 
were prepared, analyzed by restriction mapping and/or sequenced by 
the method of Messing, et an, 1981, "Nucleic Acids Res.\ 9:309» 

20 "Preparation" of DNA from transformants means isolating plasmid 
DNA from microbial cultureo Unless otherwise stated, the 
alkaline/SDS method of Maniatis et afK, Id. p. 90 « p was used- 

"Oligonucleotides" are short length single or double stranded 
25 polydeoxynucleotides which were chemically synthesized by the method 
of Crea et al_., 1980, "Nucleic Acids -Res." 8:2331-2348 (except that . 
mesitylene nitrotriazole was used as a condensing agent) and then 
purified on polyacryl amide gels. 

30 All literature citations are expressly incorporated by reference* 
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Example 1 

Preparation of a Genomic DNA Library from B. amyloliquifaciens 
and Isolation of its Subtil i sin Gene 

5 

The known amino acid sequence of the extracellular 
B* amyloliquefaciens permits the construction of a suitable probe 
mixture. The sequence of the mature subtilisin is included (along 
with the additional information contributed by the present work) in 
10 PI 9" ^ 1* AH codon ambiguity for the sequence of amino acids at 
position. 117 through 121 is covered by a pool of eight 

oligonucleotides of the sequence AA(y)AA(y)ATGGA(y)GT. 

Chromosomal DNA isolated from amyloliquefaciens (ATCC No* 
23844) as described by J. Marmur, "J. MoT, Biol.", 31:208, was 
partially digested by Sau 3A, and the fragments size selected and 
ligated into the BamH 1 site of dephosphorylated pBS42„ (pBS42 is 
shuttle vector containing origins of replication effective both in 
Eo coli and Bacilluso It is prepared as described in Example 4*) 
The Sau3A fragment containing vectors were transformed into coli 
20 K12 strain 294 (ATCC Ho- 31446) according to the method of Mo 

Mandel, et aU, 1970, "Jo Mol« Bio," 53: 154 using 80-400 nanograms 
of library DNA per 250uL of competent cells* 

Cells from the transformation mixture were plated at a 

25 

density of 1-5 x 10* transformants per 150mm plate containing LB 
medium + 12*5 ug/ml chloramphenicol, and grown overnight at 37°C 
until visible colonies appeared* The plates were then replica 
plated onto BA85 nitrocellulose filters overlayed on LB/chloram- 
phenicol plates. The replica plates were grown 10-12 hours at 37°C 

30 

and the filters transferred to fresh plates containing LB and 
150 ug/ml spectinomycin to amplify the plasmid pool* 



35 



After overnight incubation at 37°C> filters were processed 
essentially as described by Grunstein and Hogness, 1975, "Proc. 
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Natl. Acad. Sci. (USA)" 72: 3961. Out of approximately 20,000 
successful transformants, 25 positive colonies were found. Eight of 
these positives were streaked to purify individual clones. 24 
clones from each streak were grown in microtiter wells, stamped on 
5 to two replica filters, and probed as described above with either 

AACAA(y)ATGGA(y)GT(pool 1) or AATAA{f )ATG6A(f )GT(pool 2) which differ 
by only one nucleotide. As shown in Figure 2, pool 1 hybridized to a 
much greater extent to all positive clones than did pool 2, suggesting 
1Q specific hybridization. 

Four out of five miniplasmid preparations (Maniatis et an, 
Id.) from positive clones gave identical restriction digest patterns 
when digested with Sau3A or Hindi. The plasmid isolated from one of 
15 these four identical colonies by the method of Maniatis et aK, Id., 
had the entire correct gene sequence and was designated pS4. The 
characteristics of this plasmid as determined by restriction analysis 
are shown in Figure 3. 



20 

Example 2 
Expression of the Subtil isin Gene 

25 Bacillus subtil is 1-168 (Catalog No. 1-A1, Bacillus Genetic 

Stock Center) was transformed with pS4 and and a single chloramphenicol 
resistant transformant then grown in minimal medium. After 24 hours, 
the culture was centrifuged and both the supernatant (10-200 ul) and 
pellet assayed for proteolytic activity by measuring the change in 

3Q absorbance per minute at 412 nm using 1 ml of the chromogenic substrate 
succinyl-L-ala-ala-pro-phe-p-nitroanilide (0.2 M M) in 0.1M sodium 
phosphate (pH 8.0) at 25°C. A B. subtil is 1-168 culture transformed 
with pBS42 used as a control showed less than 1/200 of the activity 
shown by the pS4 transformed culture. Greater than 95 percent of the 

35 protease activity of the pS4 culture was present in the supernatant, 
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and was completely Inhibited by treatment with phenyl methyl sulfonyl 
fluoride (PMSF) but not by EDTA. 

Aliquots of the supernatants were treated with PMSF and EDTA to 
5 inhibit all protease activity and analyzed by 12 percent SDS-PAGE 

according to the method of Laemmli, U.K., 1970 "Mature", 227: 680. To 
prepare the supernatants, 16 uL of supernatant was treated with ImM 
PMSF, 10 mM EDTA for 10 minutes, and boiled with 4 uL of 5x 
concentrated SDS sample buffer minus B-mercaptoethanol . The results 
10 of Coomassie stain on runs using supernatants of cells transformed 
with pS4; pBS42, and untransformed amy! oliquefaciens are shown in 
Figure 4. Lane 3 shows authentic subtilisin f rom IK arqyloliquefaciens . 
Lane 2 which is the supernatant from pBS42 transformed subtil is , 
does not give the 31,000 HW band associated with subtilisin which is 
15 exhibited by Lane 1 from pS4 transformed hosts. The approximately 
31,000 MW band result for subtilisin is characteristic of the slower 
mobility shown by the known M*W* 27,500 subtilisin preparations in 
general <, 

20 

Example 3 

Sequencing of the B. any! oliquefaciens Subtilisin Gene 

25 The entire sequence of an EcoRI-BamHI fragment (wherein the EcoRl 
site was constructed by conversion of the Hindi site) of pS4 was 
determined by the method of F. Sanger, 1977, "Proc. Natl. Acad. Sci 
(USA)", 74:5463. Referring to the restriction map shown in Figure 3, 
the BamHI-PvuII fragment was found to hybridize with pool 1 

30 oligonucleotides by Southern analysis. Data obtained from sequencing 
of this fragment directed the sequencing of the remaining fragments 
(e.g. PvuII-HincII and Aval-Aval). The results are shown in Figure 1. 

Examination of the sequence confirms the presence of codons for 
35 the mature subtilisin corresponding to that secreted by the 
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Bio amyloliquefaciens . Immediately upstream from this sequence is a 
series of 107 codons beginning with the GTG start codon at -107, 
Codon -107 to approximately codon -75 encodes an amino acid sequence 
whose characteristics correspond to that of known signal sequences. 

5 (Most such signal sequences are 18-30 amino acids in length, have 
hydrophobic cores, and terminate in a small hydrophobic amino acid.) 
Accordingly, examination of the sequence data would indicate that 
codons -107 to approximately -75 encode the signal sequence; the 
remaining intervening codons between -75 and -1 presumably encode a 

10 prosequence- 



Example 4 

15 Construction of pBS42 

pBS42 is formed by three-way ligation of fragments derived from 
pUBHOp pC194, and pBR322 (see Figure 5), The fragment from pUBHO is 
the approximately 2600 base pair fragment between the Hpall site at 

20 1900 and the BamHl site at 4500 and contains an origin of replication 
operable in Bacillus: To Grycztan, et aK, 1978 "Jo Bacterid o", 134 : 
318 (1978); Ao Jalanko, et a!., 1981 "Gene", 14: 325*. The BamHI site 
was tested with Klenow. The pBR322 portion is the 1100 base pair 
fragment between the PvuII site at 2067 and the Sau3A site at 3223 

25 which contains the E. coli origin of replication: F. Bolivar, et al 
1977 "Gene*, 2: 95; J. Sutcliffe, 1978, Cold Spring Harbor Symposium 
43: I, 77. The pC194 fragment is the 1200 base pair fragment between 
the Hpall site at 973 and the Sau3A site at 2006 which contains the 
gene for chloramphenicol resistance expressible in both £. coli and B. 

30 subtil is : S. Ehrlich, "Proc. Natl. Acad. Sci. (USA)", 74:1680; 
S. Horynuchi et al_., 1982, "J. Bacterid. " 150: 815. 

pBS42 thus contains origins of replication operable both in 
coli and in Bacillus and an expressible gene for chloramphenicol 
35 resistance. 
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Example 5 

Isolation and Sequencing of the B. subtilis Subtilisin Gene 

5 EL subtilis 1168 chromosomal DNA was digested with EcoRI and the 
fragments resolved on gel electrophoresis- A single 6 kb fragment 
hybridized to a [a- 32 P] CTP nick translation - labelled fragment 
obtained from the C- terminus of the subtilisin structural gene in pS4, 
described above. The 6 kb fragment was electroluted and li gated into 

10 pBS42 which had been digested with EcoRI and treated with bacterial 
alkaline* phosphatase* E. coli ATCC 31446 was transformed with the 
ligation mixture and transformants selected by growth on LB agar 
containing 12-5 ng chloramphenicol /ml . Plasmid DNA was prepared from 
a pooled suspension of 5,000 transformed colonies. This DNA was 

15 transformed into subtilis BG84, a protease deficient strain, the 
preparation of which is described in Example 8 below. Colonies which 
produced protease were screened by plating on LB agar plus 1.5 percent 
w/w Carnation powdered nonfat skim milk and 5 \ig chloramphenicol /ml 
(hereafter termed skim milk selection plates) and observing for zones 

20 of clearance evidencing proteolytic activity. 

Plasmid DNA was prepared from protease producing colonies, 
digested with EcoRI , and examined by Southern analysis for the 
presence of the 6 kb EcoRI insert by hybridization to the 

25 32 P-labelled C-terminus fragment of the subtilisin structural gene 
from B. amyloliquefaciensc A positive clone was identified and the 
plasmid was designated pS168.1. Eh subtilis BG84 transformed with 
pS168.1 excreted serine protease at a level 5-fold over that produced 
in B. subtilis 1168. Addition of EDTA to the supernatants did not 

30 affect the assay results, but the addition of PMSF 

(phenylmethylsufonyl fluoride) to the supernatants reduced protease 
activity to levels undetectable in the assay described in Example 8 
for strain BG84. 
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A restriction map of the 6.5 kb EcoRI insert is shown in Fig. 6. 
The subtilisin gene was localized to v/i thin the 2.5 kb KpnI-EcoRI 
fragment by subcloning various restriction eh?yme digests and testing 
for expression of subtilisin in EL subtil is BG64. Southern analysis 
5 with the labelled fragment from the C-terminus of the 

B.« amyloliquefaciens subtilisin gene as a probe localized the 
C-terminus of the B. subtil is gene to within or part of the 631 bp 
Hindi fragment B in the center of this subclone {see Fig. 6). The 
tandem Hindi fragments B, C, and D and HincII-EcoRI- fragment E 

10 (Fig. 6) were ligated into the M13 vectors mp8 or mp9 and sequenced in 
known fashion (J. Messing et al_., 1982, "Gene" 19:209-276) using 
dideoxy chain termination (F. Sanger et aK, 1977, "Proc. Nat. Acad. 
Sci. U.S.A. " 74:5463-5467). The sequence of this region is shown in 
Fig. 7. The first 23 amino acids are believed to be a signal 

15 peptide. The remaining 83 amino acids between the signal sequence and 
the mature coding sequence constitute the putative "pro" sequence. 
The overlined nucleotides at the 3 1 end of the gene are believed to be 
transcription terminator regions. Two possible Shine-Dalgarno 
sequences are underlined upstream from the mature start codon. 

20 

Example 6 

Manufacture of an Inactivating Mutation of the B. subtilis 
25 Subtilisin Gene 

A two step ligation, shown in Fig. 8, was required to construct a 
plasmid carrying a defective gene which would integrate into the 
Bacillus chromosome. In the first step, pS168.1, which contained the 

30 6.5 kb insert originally recovered from the B. subtilis genomic 

library as described in Example 5 above, was digested with EcoRI, the 
reaction products treated with Klenow, the DNA digested with Hindi, 
and the 800 bp EcoRI-HincII fragment E (see Fig. 6) that contains, in 
part, the 5' end of the JJ. subtilis subtilisin gene, was recovered. 

35 This fragment was ligated into pJHIOl (pJHIOl is available from 
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J. Hoch (Scripps) and 1s described by F.A. Ferrari et aK, 1983, 
"J. Bact." 134:318-329) that had been digested with Hindi and treated 
with bacterial alkaline phosphotase. The resultant plasmid, pIDVl, 
contained fragment E in the orientation shown in Fig. 8. In the 

5 second step, pS168.1 was digested with Hindi and the 700 bp Hindi 
fragment B, which contains the 3* end of the subtil isin gene, was 
recovered, pIDVl was digested at its unique Hindi site and 
fragment B ligated to the linearized plasmid, transformed in E. coli 
ATCC 31,446, and selected on LB plates containing 12.5 pg 

10 chloramphenicol /ml or 20 pg ampicil 1 in/ml . One resulting plasmid, 
designated pIDV1.4, contained fragment B in the correct orientation 
with respect to fragment E. This plasmid pIDV1.4, shown in Fig. 8, is 
a deletion derivative of the subtilisin gene containing portions of 
the 5! and 3' flanking sequences as well. 

15 

EL subtil is BG77, a partial protease-deficient mutant (Prt +yf ~) 
prepared in Example 8 below was transformed with pIDV1.4. Two classes 
of chloramphenicol resistant (Cm r ) transformants were obtained. 
Seventy-five percent showed the same level of proteases as BG77 

20 (Prt + ^") and 25 percent were almost completely protease deficient 
(Prt~) as observed by relative zones of clearing on plates containing 
LB agar plus skim milk. The Cm r Prt" transformants could not be 
due to a single crossover integration of the plasmid at the homologous 
regions for fragment E or B because, in such a case, the gene would be 

25 uninterrupted and the phenotype would be Prt + ^~. In fact, when 

either of fragments E or B were ligated independently into pJHIOl and 
subsequently transformed into IB. subtHis BG77, the protease deficient 
phenotype was not observed. The Cm r phenotype of Cm r Prt" 
pIDV1.4 transformants was unstable In that Cm s Prt" derivatives 

3Q could be Isolated from Cm r Prt" cultures at a frequency of about 
0.1 percent after 10 generations of growth in minimal medium in the 
absence of antibiotic selection. One such derivative v/as obtained and 
designated BG2018. The deletion was transferred into IA84 (a BGSC 
strain carrying two auxotrophic mutations flanking the subtilisin gene). 

35 by PBS1 transduction. The derivative organism was designated BG2019. 
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Example 7 

Preparation of a Genomic DMA Library from B. subtilis and 
Isolation of its Neutral Protease Gene 

5 

The partial amino acid sequence of a neutral protease of 
B. subtilis is disclosed by P. Levy et aK 1975, "Proc.'Nat. Acad. 
Sci. USA" 72:4341-4345. A region of the enzyme (Asp Gin Met lie Tyr 
Gly) was selected from this published sequence in which the least 
10 redundancy existed in the potential codons for the amino acids in the 
region. -24 combinations were necessary to cover all the potential 
coding sequences, as described below. 

GA J CA * ATG AT J TA C GG 
15 C A C 

. A 

Asp Gin Met He Tyr Gly 

Four pools, each containing six alternatives, were prepared as 
20 described above in Example 1. The pools were labelled by 
phosphorylation with Cr- 32 p] ATP* 

The labelled pool containing sequences conforming closest to a 
unique sequence in a B« subtilis genome was selected by digesting 

25 B. subtilis (1A72 S Bacillus Genetic Stock Center) DNA with various 
restriction enzymes, separating the digests on an electrophoresis 
gel, and hybridizing each of the four probe pools to each of the 
blotted digests under increasingly stringent conditions until a 
single band was seen to hybridize. Increasingly stringent 

30 conditions are those which tend to disfavor hybridization, e.g., 
increases in formamide concentration, decreases in salt 
concentration and increases in temperature* At 37°C in a solution 
of 5x Denhardt's, 5x SSC, 50 mM HaP0 4 pH 6.8 and 20 percent 
formamide, only pool 4 would hybridize to a blotted digest- These 

35 were selected as the proper hybridization conditions to be used for 
the neutral protease gene and pool 4 was used as the probe. 
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A lambda library of B. subtilis strain BGSC 1-A72 was prepared 
in conventional fashion by partial digestion of the Bacillus genomic 
DNA by Sau3A, separation of the partial digest by molecular weight 
on an electrophoresis gel, elution of 15-20 kb fragments (R. Lawn 
et al_., 1981, "Nucleic Adds Res." 9:6103-6114), and ligation of the 
fragments to BamHI digested charon 30 phage using a Packagene kit 
from Promega Biotec« 

E. coli DPBOsupF was used as the host for the phage library, 
although any known host for Charon lambda phage is satisfactory . 
The £. coli host was plated with the library phage <and cultured, 
after which plaques were assayed for the presence of the neutral 
protease gene by transfer to nitrocellulose and screening with probe 
pool 4 (Benton and Davis, 1977, "Science" 196:180-182)- Positive 
plaques were purified through two rounds of single plaque 
purification, and two plaques were chosen for further study, 
designated xNPRGl and xNPRG2o DNA was prepared from each phage by 
restriction enzyme hydrolysis and separation on electrophoresis 
gels. The separated fragments were blotted and hybridized to 
labelled pool 4 oligonucleotides. This disclosed that xNPRGl 
contained a 2400 bp Hindi I I hybridizing fragment, but no 4300 EcoRI 
fragment, while xNPRG2 contained a 4300 bp EcoRI fragment, but no 
2400 bp Hindi I I fragment. 

The 2400 bp xNPRGl fragment was subcloned into the Hindlll site 
of pJHIOl by the following method. xNPRGl was digested by Hindlll, 
the digest fractionated by electrophoresis and the 2400 bp fragment 
recovered from the gel. The fragment was ligated to alkaline 
phosphatase-treated Hindlll digested pJHIOl and the ligation mixture 
used to transform E. coli ATCC 31446 by the calcium chloride shock 
method of V. Hershfleld et al_. , 1974, "Proc. Nat. Acad. Sci. 
(U.S.A.)" 79:3455-3459). Transformants were identified by selecting 
colonies capable of growth on plates containing LB medium plus 
12.5 ug chloramphenicol /ml . 
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Transformant colonies yielded several plasmids. The orientation 
of the 2400 bp fragment in each plasmid was determined by 
conventional restriction analysis (orientation is the sense reading 
or transcriptional direction of the gene fragment in relation to the 
5 reading direction of the expression vector Into which it is 

ligated.) Two plasmids with opposite orientations were obtained and 
designated pNPRsubH6 and pNPRsubHl. 

The 4300 bp EcoRI fragment of XNPRG2 was subcloned into pBR325 
10 by the method described above for the 2400 bp fragment except that 
XNPRG2 was digested with EcoRI and the plasmid was alkaline 
phosphatase-treated, EcoRI-digested pBR325. pBR325 is described by 
F. Bolivar, 1978, "Gene" 4:121-136. Two plasmids were identified in 
which. the 4300 bp insert was present in different orientations. 
15 These two plasmids were designated pNPRsubRI and pNPRsubRIb. 

Example 8 

20 Characterization of B. subtil is Neutral Protease Gene 

The pNPRsubHl insert was sequentially digested with different 
restriction endonucl eases and blot hybridized with labelled pool 4 
in order to prepare a restriction map of the insert (for general 

25 procedures of restriction mapping see T. Maniatis et al Id., 
p. 377). A 430 bp Rsal fragment was the smallest fragment that 
hybridized to probe pool 4. The Rsal fragment was li gated into the 
Smal site of M13 mp8 (J. Messing et aK, 1982, "Gene" 19:269-276 and 
Jo Messing in Methods in Enzymology , 1983, R. Wu et al^, Eds., 

30 101*20-78) and the sequence determined by the chain-terminating 
dideoxy method (F. Sanger et al_. , 1977, "Proc. Nat. Acad. Set. 
U.S. A." 74:5463-5467). Other restriction fragments from the 
pNPRsubHl insert were li gated into appropriate sites in M13 mp8 or 
M13 mp9 vectors and the sequences determined. As required, dITP was 

35 used to reduce compression artifacts (D. Mills et al_., 1979, "Proc. 
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Nat. Acad. Sci. (U.S.A.)" 76:2232-2235). The restriction map for 
the pNPRsubHl fragment is shown in Fig. 9. The sequences of the 
various fragments from restriction enzyme digests were compared and 
an open reading frame spanning a codon sequence translatable into 
5 the amino and carboxyl termini of the protease (P. Levy et al_.» Id.) 
was determined. An open reading frame is a DNA sequence commencing 
at a known point which in reading frame (every three nucleotides) 
does not contain any internal termination codons. The open reading 
frame extended past the amino terminus to the end of the 2400 bp 

10 Hindlll fragment. The 1300 bp Bglll - Hindlll fragment was prepared 
from pNPRsubRIb (which contained the 4300 bp EcoRI fragment of 
\NPRG2) and cloned in H13 mp8. The sequence of this fragment, which 
contained the portion of the neutral protease leader region not 
encoded by the 2400 bp fragment of pNPRsubHl, was determined for 400 

1 5 nucleotides upstream from the Hindlll site. 

The entire nucleotide sequence as determined for this neutral 
protease gene, including the putative secretory leader and prepro 
sequence, are shown in Fig. 10. The numbers above the line refer to 

20 amino acid positions. The underlined nucleotides in Fig. 10 are 
believed to constitute the ribosome binding (Shine-Dai garno) site, 
while the overlined nucleotides constitute a potential hairpin 
structure presumed to be a terminator. The first 27 - 28 of the 
deduced amino acids are believed to be the signal for the neutral 

25 protease, with a cleavage point at ala-27 or ala-28. The "pro" 
sequence of a proenzyme structure extends to the ami no- terminal 
amino acid (ala-222) of the mature/active enzyme. 

A high copy plasmid carrying the entire neutral protease gene 
30 was constructed by (Fig. 11) li gating the Bglll fragment of 

pNPRsubRl, which contains 1900 bp (Fig. 9), with the PvuII - Hindlll 
fragment of pNPRsubHl, which contains 1400 bp . pBS42 (from 
Example 4) was digested with BamHI and treated with bacterial 
alkaline phosphatase to prevent plasmid recircularization. 
35 pNPRsubRl was digested with Bglll, the 1900 bp fragment was isolated 
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from gel electrophoresis and If gated to the open BamHI sites of 
PBS42. The ligated plasmid was used to transform. E, col i ATCC 31446 
by the calcium chloride shock method (V. Hershfield et al . , Id.), 
and transformed cells selected by growth oh plates containing LB 
5 medium with 12*5 ug/ml chloramphenicol. A plasmid having the Bgl II 
fragment in the orientation shown in Fig. 11 was isolated from the 
transformants and designated pNPRsubBl. pNPRsubBl was digested 
(linearized) with EcoRI, repaired to flush ends by Kl enow treatment 
and then digested with Hindlll. the larger fragment from the 
10 Hindlll digestion (containing the sequence coding for the amino 
terminal* and upstream regions) was recovered- 

The carboxyl terminal region of the gene was supplied by a 
fragment from pNPRsubHl, obtained by digestion of pNPRsubHl with 
PvuII and Hindi II and recovery of the 1400 bp fragment,, The flush 
end PvuII and the Hindlll site of the 1400 bp fragment was ligated, 
respectively, to the blunted EcoRI and the Hindlll site of " 
pNPRsubBl s as shown in Fig. 11. This construct was used to 
transform subtil is strain B684 which otherwise excreted no 
proteolytic activity by the assays described below. Transformants 
were selected on plates containing LB medium plus 1.5 percent 
carnation powdered nonfat milk and Sug/ml chloramphenicol o Plasmids 
from colonies that cleared a large halo were analyzed* Plasmid 
pNPRlO, incorporating the structural gene and flanking regions of 
the neutral protease gene, was determined by restriction analysis to 
have the structure shown In Fig. 11. - 

B. subtil is strain BG84 was produced by N-methyl-N'-nitro-N- 
nitrosoguanidine (NTG) mutagenesis of subtil is 1168 according to, 
30 the general technique of Adelberg et al., 1965, "Biochem. Biophys. 
Res. Commun." 18:788-795. Mutagenized strain 1168 was plated on 
skim milk plates (without antibiotic). Colonies producing a smaller 
halo were picked for further analysis. Each colony was 
characterized for protease production on skim milk plates and 
35 amylase production on starch plates. One such isolate, which was 
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partially protease deficient, anylase positive and capable of 
sporulation, was designated BG77. The protease deficiency mutation 
was designated prt-77. The prt-77 allele was moved to a spoOA 
background by congression as described below to produce strain BG84, 
a sporulation deficient strain. 



Table A 



Strain 



Relevant Genotype 



origin 



1168 trpC2 

JH703 trpC2 , pheA12 , spoOAa677 

BG16 ' purB6 , metB5, leuA8 , lys-21 , hisA, thr-5 

sacA321 

BG77 trpC2 , prt-77 

BG81 metBS , prt-77 

BG84 spo0a677 , prt-77 



Trousdale et al » a 
Pb 1665 

NTG x 1168 
BG16 DNA x BG77 
JH703 DMA x BG81 



* "Molo Gen. Genetics" 173:61 (1979) 



BG84 was completely devoid of protease activity on skim milk 
plates and does not produce detectable levels of either subtilisin 
or neutral protease when assayed by measuring the change in 
absorbance at 412 nm per minute upon incubation with 0.2 ug/ml 
succinyl (-L-ala-L-ala-L-pro-L-phe) p-nitroanilide (Vega) in OA M 
sodium phosphate, pH 8, at 25°C. BG84 was deposited in the ATCC as 
deposit number 39382 on. July 21, 1983. Samples for subtilisin assay 
were taken from late logarithmic growth phase supernatants of 
cultures grown in modified Schaeffer's medium (T. Leighton et al_., 
1971, "J. Biol. Chem." 246:3189-3195). 
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Example 9 

Expression of the Neutral Protease Gene . 

BG84 transformed with pNPRlO v/as inoculated into minimal media 
supplemented With 0.1 percent casein lydrolysate and 10 Vg 
chloramphenicol and cultured for 16 hours. 0.1 ml of culture 
supernatant was removed and added to a suspension of 1.4 mg/ml 
Azocoll proteolytic substrate (Sigma) in 10 mM Tris-HCl, 100 mM NaCl . 
pH 6.8 and incubating with agitation. Undigested substrate was 
removed -by centrifugation and the optical density read at 505 nm. 
Background values of an Azocoll substrate suspension were 
subtracted. The amount of protease excreted by a standard 
protease-expressing strain, BG16 was used to establish an arbitrary 
level of 100- The results with BG16, and with BG84 transformed with 
control and neutral protease gene-containing plasmids are shown in 
Table B in Example 12 below. Transformation of the excreted 
protease-devoid B. subtil is strain BG84 results in excretion of 
protease activity at considerably greater levels than in BG16, the 
wild-type strain. 

Example 10 

Manufacture of an Inactivating Mutation of the Neutral Protease 
Gene 

The two Rsal bounded regions in the 2400 bp insert of pNPRsubHl, 
totalling 527 bp, can be deleted in order to produce an incomplete 
structural gene. The translational products of this gene are 
en^ymatically inactive. A plasmid having this deletion was 
constructed as follows. pJHIOl was cleaved by digestion with 
HindUI and treated with bacterial alkaline phosphatase. The 
fragments of the neutral protease gene to be incorporated into 
linearized pJHIOl were obtained by digesting pNPRsubHl with HindUI 
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and Rsal, and recovering the 1200 bp Hindlll-Rsal and 680 bp 
RsaI-H1ndIII fragments by gel electrophoresis. These fragments were 
ligated into linearized pJHIOl and used to transform Eo coli 
ATCC 31446. Transformants were selected on plates containing LB 
5 medium and 20 pg ampicill in/ml . Plasmids were recovered from the 
transformants and assayed by restriction enzyme analysis to identify 
a plasmid having the two fragments in the same orientation as in the 
pNPRsubHl starting plasmid. The plasmid lacking the internal Rsal 
fragments was designated pNPRsubHlA« 

10 

Example 11 

Replacement of the Neutral Protease Gene with a Deletion Mutant 

15 

Plasmid pNPRsubhlA was transformed into subtil is strain 
BG2019 (the subtilisin deleted mutant from Example 6) and 
chromosomal integrants were selected on skim milk plates., Two types 
of Cm r transformants were noted, those with parental levels of 

20 proteolysis surrounding the colony, and those with almost no zone of 
proteolysis* Those lacking a zone of proteolysis were picked, 
restreaked to purify Individual colonies, and their protease 
deficient character on skim milk plates confirmed. One of the 
Cm r , proteolysis deficient colonies was chosen for further studies 

25 (designated BG2034). Spontaneous Cm s revertants of BG2034 were 
isolated by overnight growth in LB media containing no Cm, plating 
for individual colonies, and replica plating on media with and 
without Cmo Three Cm s revertants were isolated, two of which were 
protease proficient, one of which was protease deficient (designated 

30 BG2036)o Hybridization analysis of BG2036 confirmed that the 

plasmid had been lost from this strain, probably by recombination, 
leaving only the deletion fragments of subtilisin and neutral 
protease. 

35 
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Phenotype of Strains. Lacking Functional Subtilisin and Neutral 
Protease 

The growth, sporulation and expression of proteases was examined 
in strains lacking a functional gene for either the neutral or 
alkaline protease or both. The expression of proteases was examined 
by a zone of clearing surrounding a colony on a skim milk plate and * 
by measurement of the protease levels in liquid culture supematants 
(Table B). A strain (BG2035) carrying the subtilisin gene deletion, 
and showed a 30 percent reduction level of protease activity and a 
normal halo on milk plates. Strain BG2043, carrying the deleted 
neutral protease gene and active subtilisin gene,. and constructed by 
transforming BG16 (Ex« 8) with DNA from BG2036 (Example 11), showed 
an 80 percent reduction in protease activity and only a small halo 
on the milk plate* Strain BG2054, considered equivalent to BG2036 

Table B 

Effect of protease deletions on protease expression and sporulation* 



Genotype 3 Protease activity" Percent Sporulation 

BG16 Wild type 100 40 

BG2035 aprA684 70 20 

BG2043 nprEA522 20 20 

BG2054 aprA684,nprEA522 ND 45 

BG84(pBS42) spoOAA677,prt-77 ND 

BG84(pNPR10) spoOAA677,prt-77 3000 



Only the loci relevant to the protease phenotype are shown, 
'Protease activity is espressed in arbitrary units, BG16.was assigned a 
level of 100, ND indicates the level of protease was not detectable in 
the assay used. 
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(Example 11) in that it carried the foregoing deletions in both 
genes, showed no detectable protease activity in this assay and no 
detectable halo on milk plates. The deletion of either or both of 
the protease genes had no apparent effect on either growth or 
5 sporulation. Strains carrying these deletions had normal growth 
rates on both minimal glucose and LB media. The strains sporulated 
at frequencies comparable to the parent strain BG16. Examination of 
morphology of these strains showed no apparent differences from 
strains without such deletions. 

10 

Example 13 

Site-specific Saturation Mutagenesis of the B. Amyloliguefaciens 
Subtilisin Gene at Position 222; Preparation of the Gene for 
15 Cassette Insertion 

pS4-5, a derivative of pS4 made according to Wells et al., 
"Nucleic Acids Res. 11 , 1983, U_:7911-7924 was digested with EcoRI and 
BamHI, and the 1.5 kb EcoRI -BamHI fragment recovered. This fragment 

20 was ligated into replicative form M-13 mp9 which had been digested 
with EcoRI and BamHI (Sanger et al-, 1980, "J. Mol . Biol." 143 
161-178. Messing et al_, 1981, "Nucleic Acids Research" 9, 304-321. 
Messing, J. and Vieira, J. (1982) Gene 19, 269-276). The M-13 mp9 
phage ligations, designated M-13 mp9 SUBT, were used to transform 

25 E. coli strain JM101 and single stranded phage DMA was prepared from 
a two mL overnight culture. An oligonucleotide primer was 
synthesized having the sequence 

5 '-GTACAACGGTACCTCACGCACGCTGCAGGAGCGGCTGC-S' . This primer conforms 
to the sequence of the subtil is gene fragment encoding amino acids 

30 216-232 except that the 10 bp of codons for amino acids 222-225 were 
deleted, and the codons for amino acids 220, 227 and 228 were 
mutated to introduce a Kpnl site 5' to the met-222 codon and a PstI 
site 3' to the met+222 codon. See Fig. 12. Substituted nucleotides 
are denoted by asterisks, the underlined codons in line 2 represent 

35 the new restriction sites and the scored sequence in line 4 
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represents the inserted oligonucleotides. The primer (about 15 pM) 
Was labelled with [ 32 p] by incubation with [r 32 p]-ATP (10 pL in. 
20 pL reaction HAmersham 5000 Ci/mmol, 10218) and T 4 . 
polynucleotide kinase (10 units) followed by nonradioactive ATP 
5 (100 pM) to allow complete phosphorylation of the mutagenesis 

primer. The kinase was inactivated by heating the phosphorylation 
mixture at 68°C for 15 min. 

The primer was hybridized to M-13 mp9 SUBT as modified from 

10 Morris et alL, 1983, "Nucleic Acids Res." 11, 5103-5112 by combining . 
5 pL of the labelled mutagenesis primer ("3 pM), ~1 pg M-13 mp9 SUBT 
template,, 1 pL of 1 uM M-13 sequencing primer (17-mer), and 2.5 pL 
of buffer (0.3 M Tris pH 8, 40 mM MgClg, 12 mM EDTA, 10 mM DTT, 
0.5 mg/ml BSA). The mixture was heated to 68°C for 10 minutes and 

15 cooled 10 minutes at room -temperature. To the annealing mixture was 
added 3.6 pL of 0.25 mM dGTP, dCTP, dATP, and dTTP, 1.25 pL of 10 mM 
ATP, 1 uL ligase (4 units) and 1 pL Klenow (5 units). The primer 
extension and ligation reaction (total volume 25 pi) proceeded 
2 hours at 14°C. The Klenow and ligase were inactivated by heating 

20 to 68 W C for 20 min. The heated reaction mixture was digested with 
BamHl and EcoRI and an aliquot of the digest was applied to a 6 
percent polyacryl amide gel and radioactive fragments were visualized 
by autoradiography. This showed the [ 32 P] mutagenesis primer had 
indeed been incorporated into the EcoRI-BamHl fragment containing 

25 the now mutated subtilisin gene. 

The remainder of the digested reaction mixture was diluted to 
200 pL with 10 mM Tris, pH 8, containing 1 mM EDTA, extracted once 
with a 1:1 (v:v) phenol /chloroform mixture, then once with 

30 chloroform, and the aqueous phase recovered. 15 pL of 5M ammonium 
acetate (pH 8) was added along with two volumes of ethanol to 
precipitate the DNA from the aqueous phase. The ONA was pelleted by 
centrifugation for five minutes in a microfuge and the supernatant 
was discarded. 300 pL of 70 percent ethanol was added to wash the 

35 DNA pellet, the wash was discarded and the pellet lyophilized. 
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pBS42 from example 4 above was digested with BamHl and EcoRI and 
purified on an acrylamide gel to recover the vector. O.Spg of the 
digested vector, 50pM ATP and 6 units ligase were dissolved in 20 ul 
of ligation buffer. The ligation went overnight at 14°C. The DNA 
was transformed into E. coli 294 rec + and the transformants grown 
in 4 ml of LB medium containing 12.5 pg/ml chloramphenicol * Plasmid 
DNA was prepared from this culture and digested with Kpnl, EcoRI and 
BamHIo Analysis of the restriction fragments showed 30-50 percent 
of the molecules contained the expected Kpnl site programmed by the * 
mutagenesis primer. It was hypothesized that the plasmid population 
not including the Kpnl site resulted from M-13 replication before 
bacterial repair of the mutagenesis site, thus producing a 
heterogenous population of Kpnl + and Kpnl" plasmids in some of 
the transformants. In order to obtain a pure culture of the Kpnl + 
plasmid, the DNA was transformed a second time into E. coli to clone 
plasmids containing the new Kpnl site. DNA was prepared from 16 
such transformants and six were found to contain the expected Kpnl 
site. 

Preparative amounts of DNA were made from one of these six 
transformants (designated p&222) and restriction analysis confirmed 
the presence and location of the expected Kpnl and PstI sites. 40 
ug of pA222 were digested in 300 pL of Kpnl buffer plus 30 pL Kpnl 
(300 units) for 1.5 h at 37°C. The DNA was precipitated with 
ethanol, washed with 70 percent ethanol, and lyophilized. The DNA 
pellet was taken up in 200 pL HindlH buffer and digested with 20 pL 
(500 units) PstI for 1-5 h at 37°C. The aqueous phase was extracted 
with phenol/CHCl 3 and the DNA precipitated with ethanol. The DNA 
was dissolved 1n water and purified by poly acrylamide gel 
electrophoresis. Following electroelution of the vector band (120 v 
for 2 h at 0°C in 0.1 times TBE (Manlatis et aU, Id.)) the DNA was 
purified by phenol/CHCl 3 extraction, ethanol precipitation and 
ethanol washing. 
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Although p*222 could be digested to completion (>98 percent) by 
either Knpl or PstI separately, exhaustive double digestion was 
incomplete («50 percent). This may have resulted from the fact 
that these sites were so close (10 bp) that digestion by Knpl 
allowed "breathing" of the DNA in the vicinity of the PstI site, 
i.e., strand separation or fraying. Since PstI will only cleave 
double stranded DNA, strand separation could inhibit subsequent PstI 
digestion. 

Example 14 

Ligation of Oligonucleotide Casettes into the Subtilisin Gene 

10 yM of four complementary oligonucleotide pools (A-D, Table 1 
below) which were not 5' phosphorylated were annealed in 20 ul 
ligase buffer by heating for five minutes at 68°C and then Cooling 
for fifteen minutes at room temperature* 1 ixM of each annealed 
oligonucleotide pool, "0.2 ng Kpnl and Pstl-digested pa222 obtained 
in Example 13, 0.5 mM ATP, ligase buffer and 6 units T 4 DNA ligase 
in 20 pL total volume was reacted overnight at 14°C to ligate the 
pooled cassettes in the vector. A large excess of cassettes (~300x 
over the pa222 ends) was used in the ligation to help prevent 
intramolecular KpnI-Kpnl ligation. The reaction was diluted by 
adding 25 uL of 10 mM Tris pH 8 containing 1 mM EDTA. The mixture 
was reannealed to avoid possible cassette concatemer formation by 
heating to 68°C for five minutes and cooling for 15 minutes at room 
temperature. The ligation mixtures from each pool were transformed , 
separately into E. coli 294 rec + cells. A small aliquot from each 
transformation mixture was plated to determine the number of 
independent transformants. The large number of transformants 
indicated a high probability of multiple mutagenesis* The rest of 
the transformants ("200-400 transformants) were cultured in 4 ml of 
LB medium plus 12.5 ng chloramphenicol /ml . DNA was prepared from 
each transformation pool (A-D) . This DNA was digested with Kpnl, 
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"0:1 u'9 was" used to retransform E, coli rec + and the mixture was 
plated to isolate individual colonies from each pool. Ligation of 
the cassettes into the gene and bacterial repair upon transformation 
destroyed the Kpnl and PstI sites. Thus, only p*222 was cut when 

5 the transfonnant DNA was digested with Kpnl. The cut plasmid would 
not transform E. coli . Individual transformants were grown in 
culture and DNA was prepared from 24 to 26 transformants per pool 
for direct plasmid sequencing. A synthetic oligonucleotide primer 
having the sequence 5 1 -GAGCTTGATGTCATGGC-3 1 was used to prime the 

10 dideoxy sequencing reaction. The mutants which were obtained are 
described in Table C below. 

Two codon+222 mutants (i.e., gin and ile) were not found after 
the screening described* To obtain these a single 25mer 

15 oligonucleotide was synthesized for each mutant corresponding to the 
top oligonucleotide strand in Figure 12. Each was phosphorylated 
and annealed to the bottom strand of its respective 
nonphosphorylated oligonucleotide pool (i.e., pool A for gin and 
pool D for ile). This was li gated into Kpnl and PstI digested p*222 

20 and processed as described for the original oligonucleotide pools. 
The frequency of appearance for single mutants obtained this way was 
2/8 and 0/7 for gin and ile, respectively. To avoid this apparent 
bias the top strand was phosphorylated and annealed to its 
unphosphorylated complementary pool. The heterophosphorylated 

25 cassette was 11 gated into cut p*222 and processed as before. The 
frequency of appearance of gin and i.le mutants was now 7/7 and 7/7, 
respectively. 

The data in Table C demonstrate a bias in the frequency of 
30 mutants obtained from the pools.. This probably resulted from 

unequal representation of oligonucleotides in the pool. This may 
have been caused by unequal coupling of the particular trimers over 
the mutagenesis codon in the pool. Such a bias problem could be 
remedied by appropriate adjustment of trimer levels during synthesis 
35 to reflect equal reaction. In any case, mutants which were not 
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isolated in the primary screen were obtained by synthesizing a 
single strand oligonucleotide representing the desired mutation, 
phosphorylating both ends, annealing to the pool of 
non-phosphorylated complementary strands and li gating into the 
cassette site, A biased heteroduplex repair observed for the 
completely unphosphorylated cassette may result from the fact that 
position 222 is closer to the 5 1 end of the upper strand than it is 
to the 5 1 end of the lower strand (see Figure 12). Because a gap 
exists at the unphosphorylated 5' ends and the mismatch bubble in 
the double stranded DNA is at position 222, excision repair of the 
top strand gap would more readily maintain a circularly hybridized 
duplex capable of replication. Consistent with this hypothesis is 
the fact that the top strand could be completely retained by 
selective 5' phosphorylation- In this case only the bottom strand 
contained a 5' gap which could promote excision repair. This method 
is useful in directing biased incorporation of synthetic 
oligonuclotide strands when employing mutagenic oligonucleotide 
cassettes. 



Example 15 

Site-Specific Mutagenesis of the Subtilisin Gene at Position 166 

The procedure of Examples 13-14 was followed in substantial 
detail, except that the mutagenesis primer differed (the 37 mer 
shown in Fig. 13 was used), the two restriction enzymes were SacI 
and Xmalll rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig- 13. 

Bacillus strains excreting mutant subtilisins at position 166 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala, asp, gin, phe, his, 
lys, asn, arg, and val for the wild-type residue were recovered. 
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Example 16 

Preparation of Mutant Subtil i sin Enzymes 

B. subtil is strain BG2036 obtained by the method of Example 11 
was transformed by the plasmids of Examples 14, 15 or 20 and by 
pS4-5 as a control. Transformants were plated or cultured in shaker 
flasks for 16 to 48 h at 37°C in LB media plus 12.5 ng/ml 
chloramphenicol. Mutant enzymatically active subtilisin was 
recovered by dialyzing cell broth against 0.01M sodium phosphate 
buffer, pH 6.2. The dialyzed broth was then titrated to pH 6.2 with 
IN HC1 and loaded on a 2.5 x 2 cm column of CM cellulose (CM-52 
Whatman). After washing with 0.01M sodium phosphate, pH 6.2, the 
subtil i sins (except mutants at position +222) were eluted with the 
same buffer made 0.08N in NaCl . The mutant subtilisins at position 
+222 were each eluted with 0.1M sodium phosphate, pH 7.0. The 
purified mutant and wild type enzymes were then used in studies of 
oxidation stability, Km, Kcat, Kcat/Km ratio, pH optimum, and 
changes in substrate specificity. 



20 



25 



30 



35 
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Table C Oligonucleotide Pool Organization 
and Frequency of Mutants Obtained 



Pool 


Amino Acids Codon-222a 


Frequency^ 


A 


asp 


GAT 


2/25 




met 


ATG 


3/25 




cys 


TGT 


13/25 




arg 


AGA 


2/25 




gin 


GAA 


0/25 




unexpected mutants^ 




5/25 


B 


leu 


ATT 

CTT 


1/25 




pro 


CCT 


3/25 




pi IC 


TTf 


6/25 




tyr 


TAC 


5/25 




his 


CAC 


i /or 

1/25 




unpexpected mutants 




9/25 


c 


glu. 


GAA 


3/17 




ala 


GCT 


3/17 




thr 


ACA 


1/17 




iys 


AAA 


1/17 




asn 


AAC 


1/17 




unexpected mutants 




8/17 


D 


giy 


GGC 


1/23 




trp 


TGG 


8/23 




ile 


ATC 


0/23 




ser 


AGC 


1/23 




val 


GTT 


4/23 




unexpected mutants 




9/23 



Codons were chosen based on frequent use in the cloned 
subtilisin gene sequence (Wells et al_., 1983, Id.). 

Frequency was determined from single track analysis by direct 
plasmid sequencing. 

Unexpected mutants generally comprised double mutants with 
changes in codons next to 222 or at the points of ligation. 
These were believed to result from impurities in the 
obigonucleotide pools and/or erroneous repair of the gapped 
ends. 



-48- G130756 

Example 17 

Mutant Subtilisin Exhibiting Improved Oxidation Stability 

Subtil isins having cysteine and alanine substituted at the 222 
position for wild-type methionine (Example 16) were assayed for 
resistance to oxidation by incubating with various concentrations of 
sodium hypochloride (CI orox Bleach). 

To a total volume of 400 pi of 0.1M, pH 7, MaP0 4 buffer 
containing the indicated bleach concentrations (Fig. 14) sufficient 
enzyme was added to give a final concentration of 0.016 mg/ml of 
enzyme o The solutions were incubated at 25°C for 10 min. and 
assayed for enzyme activity as follows: 120 ul of either ala+222 or 
wild type, or 100 ul of the cys+222 incubation mixture was combined 
with 890 pi 0.1M tris buffer at pH 8.6 and 10 ul of a sAAPFpN 
(Example 18) substrate solution (20 mg/ml in DMS0K The rate of 
increase in absorbance at 410 nm due to release of p-nitroaniline 
(Del Mar, E.G., et aK, 1979 "Anal. Biochem." 99, 316-320) was 
monitored. The results are shown in Fig* 14* The alanine 
substitution produced considerably more stable enzyme than either 
the wild-type enzyme or a mutant in which a labile cysteine residue 
was substituted for methionine. Surprisingly, the alanine 
substitution did not substantially interfere with enzyme activity 
against the assay substrate, yet conferred relative oxidation 
stability on the enzyme. The serine+222 mutant also exhibited 
improved oxidation stability. 

Example 18 

Mutant Subtil isins Exhibiting Modified Kinetics and Substrate 
Specificity 

Various mutants for glycine+166 were screened for modified 

0992Y 
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Kcat, Km and Kcat/Km ratios. Kinetic parameters were, obtained by 
analysis of the progress curves of the reactions. The rate of 
recti on was measured as a function of substrate concentration. Data 
was analyzed by fitting to the Michael is-Menton equation using the 

5 non-linear regression algorithm of Marquardt (Marquardt, D. W. 1963, 
"J. Soc. Ind. Appl. Math." 11, 431-41). All reactions were 
conducted at 25°C in 0.1M tris buffer, pH 8.6, containing 
benzoyl -L-Yalyl-Glycyl-L-Arginyl-p-nitroanilide (BVGRpN; Vega 
Biochemical s) at initial concentrations of 0.0025 M to 0.00026 M 

10 (depending on the value of Km for the en2yme of interest - 

concentrations were adjusted in each measurement so as to exceed Km) 
or succinyl -L-Al anyl -L-Al anyl -L-Prolyl -L-Phenyl al any! -p-ni tro- 
anilide (sAAPFpN; Vega Biochemicals) at initial concentrations of 
0.0010 M to 0.00028 M (varying as described for BVGRpN). 

15 

The results obtained in these experiments were as follows: 



Table D 



20 


Substrate 


Enzyme Kcat (s -1 ) 


Km (M) 


Kcat/Km 




sAAPFpM 


gly-166(w11d type) 


37 


1.4X10" 4 


3 x 10 5 






ala+166 


19 


2.7X10" 5 


7 x 10 5 






asp+166 


3 


5.8xl0" 4 


5 x 10 3 


25 




glu+166 


11 


3.4xlO" 4 


3 x 10 4 






phe+166 


3 


1.4xl0" 5 
l.lxlO" 4 


2 x 10 5 






hys+166 


15 


1 x 10 5 






lys+166 


15 


3.4xl0~ 5 


4 x 10 5 






asn+166 


26 


1.4xl0~ 4 


2 x 10 5 


30 




arg+166 


19 


6.2xl0" 5 


3 x 10 5 






val+166 


1 


1.4X10" 4 


1 x 10 4 




BVGRpN 


Wild Type 


2 


l.lxlO" 3 


2 x 10 3 






asp+166 


2 


4.1X10" 5 


5 x 10 4 






glu+166 


2 


2.7xl0 -5 


7 x 10 4 


35 




asn+166 


1 


1.2xl0" 4 


8 x 10 3 
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The Kcat/Km ratio for each of the mutants varied from that of 
the wild-type enzyme. As a measure of catalytic efficiency, these, 
ratios demonstrate that enzymes having much higher. activity against 
a given substrate can be readily designed and selected by screening 
5 in accordance with the invention herein. For example, A166 exhibits 
over 2 times the activity of the wild type on sAAPFpN. . 

This data also demonstrates changes in substrate specificity 
upon mutation of the wild type enzyme. For example, the Kcat/Km 
10 ratio for the D166 and E166 mutants is higher than the wild type 
enzyme with the BVGpN substrate, but qualitatively opposite results 
were obtained upon incubation with sAAPFpM. Accordingly, the D166 
and E166 mutants were relatively more specific for BVGRpN than for 
sAAPFpM. 

15 

Example 19 

Mutant Subtilisin Exhibiting Modified pH-Activity Profile 

20 The pH profile of the Cys+222 mutant obtained inExample 16 was 
compared to that of the wild type enzyme. 10ul of 60 mg/ml sAAPFpN 
in DMSO, 10 ul of Cys+222 (0.18 mg/ml) or wild type (0.5 mg/ml) and 
980 ul of buffer (for measurements at pH 6.6, 7.0 and 7.6, 0.1M 
NaP0 4 buffer; at pH 8.2, 8.6 and 9.2, 0.1M tris buffer; and at pH 

25 9.6 and 10.0, 0.1M glycine buffer), after which the initial rate of 
change in absorbance at 410 nm per minute was measured at each pH 
and the data plotted in Fig. 15. The Cys+222 mutant exhibits a 
sharper pH optimum than the wild type enzyme. 

30 Example 20 

Site-Specific Mutagenesis of the Subtilisin Gene at Position. 169 

The procedure of Examples 13-14 was followed in substantial 
35 detail, except that the mutagenesis primer differed (the primer 
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shown in Fig-- 16 was used), the two restriction enzymes were Kpnl 
and EcoRV rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 16. 

5 Bacillus strains excreting mutant subtilisins at position 169 

were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala and ser for the 
wild-type residue were recovered and assayed for changes in kinetic 
features. The assay employed SAAPFpN at pH 8.6 in the same fashion 

10 as set forth in Example 18. The results were as follows: 

Table E 

Enzyme Kcat (s" 1 ) Km (H) Kcat/Km 

ala+169 58 7.5 x 10" 5 8 x 10 5 

15 ser+169 38 8.5 x 10" 5 4 x 10 5 



20 



Example 21 

Alterations in Specific Activity on a Protein Substrate 



Position 166 mutants from Examples 15 and 16 were assayed for 
alteration of specific activity on a naturally occuring protein 
substrate. Because these mutant proteases could display altered 

25 specificity as well as altered specific activity, the substrate 
should contain sufficient different cleavage sites i.e., acidic, 
basic, neutral, and hydrophobic, so as not to bias the assay toward 
a protease with one type of specificity. The substrate should also 
contain no derivitized residues that result in the masking of 

30 certain cleavage sites. The widely used substrates such as 
hemoglobin, azocollogen, azocasein, dimethyl casein, etc., were 
rejected on this basis. Bovine casein, a and a 2 chains, was 
chosen as a suitable substrate. 



35 
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A 1 percent casein (w/v) solution was prepared in a 100 mM Trjs 
buffer, pH 8.0, 10 mM EDTA. The assay protocol is as follows: 

790 pi 50 mM Tris pH 8.2 
5 100 pi 1 percent casein (Sigma) solution 

10 ul test enzyme (10-200 ng). 

This assay mixture was mixed and allowed to Incubate at room 
temperature for 20 minutes. The reaction was. terminated upon the 

10 addition of 100 jil 100 percent trichloroacetic acid, followed by 
incubation for 15 minutes at room temperature. The precipitated 
protein was pelleted by centrifugation and the optical density of 
the supernatant was determined spectrophotometry ly at 280 nm. 
The optical density is a reflection of the amount of unpreci pita ted, 

15 i.e., hydrolyzed, casein in the reaction mixture. The amount of 
casein hydrolysed by each mutant protease was compared to a series 
of standards containing various amounts of the wild type protease, 
and the activity is expressed as a percentage of the corresponding 
wild type activity. Enzyme activities were converted to specific 

20 activity by dividing the casein hydrolysis activity by the. 280 nm 
absorbance of the enzyme solution used in the assay. 

All of the mutants which were assayed showed less specific 
activity on casein than the wild type with the exception of Asn+166 
25 which was 26 percent more active on casein than the wild type. The 
mutant showing the least specific activity, was ile+166 at 0.184 of 
the wild type activity. 



30 
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CLAIMS 

1. A method of preparing a procaryotic carbonyl hydrolase 
which comprises: culturing a recombinant host cell 

5 transformed with an expression vector comprising the DNA 
sequence encoding the hydrolase f and recovering the 
hydrolase from the cell culture. 

2. The method of claim 1 wherein the hydrolase is a 

10 protease, preferably a subtilisin or a metalloprotease, most 
preferably a subtilisin f the DNA sequence preferably 

encoding subtilisin in the form of prosubtilisin or 

/ 

preprosubtilisin 0 

15 3. The method of claim 1 wherein the recombinant host 
cell is a strain of Bacillus , preferably a Bacillus 
subtilis . 

4. The method of claim 2 wherein the DNA sequence 
20 encoding subtilisin is operably linked to its native 
promoter , to a promoter homologous to the host which is 
other than the native promoter, or to a promoter which is 
heterologous to the host- 



25 5 0 The process of claim 2 wherein the recombinant host 
cell was transformed with an effective expression vector for 
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the DNA sequence encoding the procaryotic protease operably 
Linked to its signal sequence. 

6. A composition comprising a procaryotic carbonyl 
5 hydrolase, preferably a Bacillus hydrolase, and a host 

microorganism transformed so as to be capable of expressing 
the hydrolase . 

7. A composition comprising prepro-, pre- or procarbonyl 
10 hydrolase, preferably prosubtilisin, essentially free of 

cells which express said prepro- , pre- or procarbonyl 
hydrolase. 

8. A liquid detergent composition comprising B. 
15 amyloliquefaciens subtilisin. 

9. An expression vector for a procaryotic carbonyl 
hydrolase which comprises a DNA sequence encoding the 
hydrolase operably linked to a promoter compatible with a 

20 suitable host cell. 

10. A recombinant expression vector comprising a DNA 
sequence encoding a prepro- or procarbonyl hydrolase, 
preferably subtilisin, operably linked to a promoter 

25 compatible with a suitable host cell; or a cell, preferably 
a strain of Bacillus , transformed by a said vector. 
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11 o A method comprising: 

(a) isolating a DNA moiety encoding a procaryotic 
carbonyl hydrolase; 

5 

(b) introducing a mutation into a predetermined region 
in the DNA which/ upon expression of the DNA, results in the 
substitution, deletion or insertion of at least one amino 
acid at a predetermined site in the hydrolase; 

10 

and optionally 

(c) transforming a suitable host with the mutated DNA 
of step (b) and recovering the expression product of the 

15 mutated DNA. 

12. The method of claim 11 wherein the mutation is 
predetermined; preferably expressed as the substitution or 
insertion of a single amino acid. 

20 

13. The method of claim 11 wherein the DNA was isolated as 
a fragment of genomic DNA from an organism expressing the 
carbonyl hydrolase; the fragment preferably consisting 
essentially of the structural gene for the hydrolase. 

25 

14. The method of claim 11 wherein the hydrolase is a 
protein hydrolase or lipase, preferably Bacillus subtilisin, 



( 
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prosubtilisin, preprosubtilisin, a metalloprbtease or other 
peptide hydrolase. 

15. The method of claim 14 wherein the hydrolase is 

5 Bacillus subtilisin and the mutation is introduced into the 
subtilisin at aspartate+32, asparagihe+155, tyrosine+104, 
methiohine+222, glycine+166, histidine+64, serine+221, 
glycine+169, glutamate+156, serine+33, phenylalanine+189 , 
tyrosine+217 and/br alanine+152. 

10 

16. The method of claim 14 wherein the hydrolase is 
prosubtilisin or preprosubtilisin and the mutation is 
introduced into the presubtilisin or preprosubtilisin at 
tyros ine-l«, 

15 

17. The method of claim 11 wherein the mutation is 
expressed as a mutant carbonyl hydrolase exhibiting a change 
in one or more of the oxidation stability, Km, Kcat, Kcat/Kra 
ratio, substrate specificity, specific activity or pH 

20 optimum of the hydrolase. 

18. The method of claim 14 wherein the hydrolase is a 
peptide hydrolase selected from a-aminoacylpeptide 
hydrolase, peptidylamino-acid hydrolase, acylamirio 

25 hydrolase, serine carboxypeptidase, metallocarbpxypeptidase, 
thiol proteinase, carboxylproteinase or metalloproteinase. 
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19. DNA encoding a predetermined mutant of a procaryotic 
carbonyl hydrolase; or a vector capable of transforming a 
host cell to produce a mutant bacterial carbonyl hydrolase, 
which vector comprises such DNA and which, upon 
5 transformation of the host cell, results in expression of 
the mutant hydrolase; or a host cell transformed with a said 
vector . 

20 o A mutant carbonyl hydrolase of the kind obtained by 
10 the method of claim 11 . 

21. A composition comprising the hydrolase of claim 20 in 
combination with a detergent, preferably a liquid detergent 
or a detergent in granular form; optionally additionally 

15 comprising a builder, bleach or fluorescent whitening agent* 

22. The composition of claim 11 wherein the detergent is a 
linear alkyl benzene sulfonate, alkyl ethoxylated sulfate, 
sulfated linear alcohol or ethyoxylated linear alcohol. 

20 

23. The method of claim 17 wherein a mutant exhibiting a 
different substrate specificity is recovered. 

24. The method of claim 11 wherein the mutation is within 
25 the enzyme active site. 
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25. The method of claim 11 wherein the DNA is expressed in 
a bacterial host cell, preferably a Bacillus species . 

26. The method of claim 11 wherein the mutation is 

5 expressed as the substitution of one or more methionine, 
tryptophan, cysteine or lysine residues by a substituent 
amino acid residue not one of methionine, tryptophan, 
cysteine or lysine, preferably alanine or serine. 

10 27. The method of claim 17 wherein the mutation renders 
the mutant enzyme either less oxidation stable or more 
oxidation stable than the precursor enzyme. 

28. DNA encoding a predetermined enzymatically active 

15 mutant of a precursor procaryotic carbonyl hydrolase, said 
mutant exhibiting a different substrate specificity, 
oxidation stability and/or pH-activity profile than the 
precursor enzyme; or a vector capable of transforming a host 
cell to produce a mutant enzyme, which vector comprises such 

20 DNA and which, upon transformation of the host cell, results 
in expression of the mutant enzyme; or a host cell 
transformed with such a vector. 

29. A method comprising culturing a host cell of claim 28 
25 and recovering therefrom the mutant enzyme. 
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30. A substantially normally sporulating Bacillus which is 
incapable of excreting subtilisin or neutral protease. 

31. The Bacillus of claim 30, preferably free of Bacillus 
strains capable of excreting subtilisin or neutral protease, 
and comprising a mutant neutral protease gene, preferably 
nonrevertible, which gene contains a deletion that results- 
in no expression or, upon expression, an enzymatically 
inactive polypeptide. 

32. The Bacillus of claim 30 which is (a) incapable of 
excreting neutral protease and (b) transformed with at least 
one DNA moiety encoding a polypeptide not otherwise 
expressed by the Bacillus , preferably a subtilisin mutant or 
a eucaryotic protein. 

33. A vegetative-phase Bacillus culture which is 
essentially free of neutral protease, or which is 
essentially free of subtilisin. 

34. a Bacillus culture free of any gene capable of 
expressing enzymatically active neutral protease, or free of 
any gene capable of excreting enzymatically active 
subtilisin. 

35. a vector comprising a deletion mutated neutral 
protease gene or a deletion mutated subtilisin gene. 
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36. A method comprising culturing the Bacillus of claim 30 
until a protein not subtilisin or neutral protease, 
preferably amylase, has accumulated in the culture,. and 
recovering the protein. 

5 

37« A method comprising: 

(a) obtaining a DNA moiety encoding at least a portion 
of said precursor protein; 

10 

(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already 
existing within the region in order to create at least one 

15 restriction enzyme site unique to the moiety, whereby unique 
restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

20 (d) synthesizing a plurality of oligonucleotides, the 

5 1 and 3 1 ends of which each contain sequences capable of 
annealing to the restriction enzyme sites introduced in step 
(c) and which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 

25 amino acid in or into said precursor protein; 
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(e) digesting the moiety of step (c) with restriction 
enzymes capable of cleaving the unique sites; 

(f) ligating each of the oligonucleotides of step (d) 
5 into the digested moiety of step (e) whereby a plurality of 

mutant DNA moieties are obtained; 

and optionally the further steps of 

10 <g) expressing each of said moieties as a mutant 

protein in a suitable host; 

(h) recovering the mutant proteins of step (g); and 

15 (i) screening the step (h) mutant proteins for the 

desirable characteristic. 

38 o The method of claim 37 wherein the restriction enzyme 
sites are different. 
20 • ■ • • * 

39. The method of claim 37 wherein the oligonucleotides 
are less than about 50 bp. 
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EcoRl Cia\ PvuW BomH\ 

P RBS^PRE^PRO MAT TERM 

k 1.5Kb >| 



B. 



RBS - 107 



1 uGJCTACTAAAAT AT T ATTCCATACTjUACAATTAATACACAfiAATAATCTGTCTAl TGGTTATTCTGC^TGAAAAAAAG GAGAGG ATAAAGA GTG 

_inn PRE _on 

Arq GW lvs lys Val Trp Ilr. Ser Leu Leu Phe Ala Leu Ala Leu lie Phe Thr Met Ala Pho Glv Ser Thr Ser 
99 AGA GGC AAA AAA GTA TGG ATC AGT TTG CTG TTT GCT TTA GCG TTA ATC TTT ACG ATG GCG TTC GGC AGC ACA TCC 

-tt -70 PRO -60 

Ser Ala Gin Ala Ala Glv Lvs Ser Asn Glv Glu Lvs Lvs Tyr lie Val Glv Php Lvs Gin Thr Met Ser Thr Met 
174 TCT GCC CAG GCG GCA GGfi AAA TCA AAC GGG GAA AAG AAA TAT ATT GTC GG6 TTT AAA CAG ACA ATG AGC ACG ATG 

-Sn -40 
Ser Ala Ala lys Lvs L,ys Asn Val lie Ser Glu lys Gly Gly Lvs Val Gin Lvs Gin Phe Lvs Tvr Val Asn Ala 
?40 AGC OCf. GCT AAG AAG AAA GAT GTC ATT TCT GAA AAA GGC GGG AAA GTG CAA AAG CAA TTC AAA TAT GTA GAC GCA 

-30 -?o -in 

Ala Ser AW Thr Leu Asn Glu Lvs Ala Val Lvs Glu Leu Lvs Lys Asp Pro Ser Val Ala Tvr Val Glu Glu Asp 
3?4 GCT TCA GCT ACA TTA AAC GAA AAA GCT GTA AAA GAA TTG AAA AAA GAC CCG AGC GTC GCT TAC GTT GAA GAA GAT 

-l|-T^ MAT 1" 

His y*1 Ala His Ala Tyr Ala Gin Ser Val Pro Tyr Gly Val Ser Gin lie Lys Ala Pro Ala Leu His Ser Gin 
3Q0 CAC GTA GCA CAT GCG TAG GCG CAG TCC GTG CCT TAC GGC GTA TCA CAA ATT AAA GCC CCT GCT CTG CAC TCT CAA 

70 30 40 

Glv Tvr Thr Gly Ser Asn Val Lvs Val Ala Val He Asp Ser Gly He Asp Ser Spr His Pro Asn Ipii Lvs Val 
474 GGC TAC ACT GGA TCA AAT GTT AAA GTA GCG GTT ATC GAC AGC GGT ATC GAT TCT TCT CAT CCT GAT TTA AAG GTA 

5(1 Pro Asn fiO Asp 

Ala Gly Glv Ala Ser Met Val Pro Ser Glti Thr Asn Pro Phe Gin Asp Asn Asn Ser His Glv Thr His Val Ala 
540 GCA GGC GGA GCC AGC ATG GTT CCT TCT GAA ACA AAT CCT TTC CAA GAC AAC AAC TCT CAC GGA ACT CAC GTT GCC 

70 po ser Ala «o 

Glv Thr Val Ala Ala Leu Asn Asn Ser He Gly Val Leu Gly Val Ala Pro Ser Ala Ser Leu Tvr Ala Vol Lvs 
624 GGC ACA GTT GCG GCT CTT AAT AAC TCA ATC GGT GTA TTA GGC GTT GCG CCA AGC GCA TCA CTT TAC GCT GTA AAA 

Asn Ala ion no 
Val leu Gly Ala Asp Gly Ser Gly Gin Tyr Ser Trp He lie Asn Gly lie Glu Trp Ala He Ala Asn Asn Met 
699 GTT CTC GGT GCT GAC GGT TCC GGC CAA TAC AGC TGR ATC ATT AAC GGA ATC GAG TGG GCG ATC GCA AAC AAT ATG 

l?n 130 ho 

Asp Val lie Asn Met Ser Leu Glv Glv Pro Ser Glv Ser Ala Ala Leu lvs Ala Ala Val Asn Lvs Ala Val Ala 
774 GAC GTT ATT AAC ATG AGC CTC GGC GGA CCT TCT GGT TCT GCT GCT TTA AAA GCG GCA GTT GAT AAA GCC GTT GCA 

ISO Ser Thr lfiO 

Ser Gly Val Val Val Val Ala Ala Ala Glv Asn Glu Gly Thr Ser Gly Ser Ser Ser Thr Val Glv Tvr Pro Glv 
840 TCC GGC GTC GTA GTC GTT GCG GCA GCC GGT AAC GAA GGC ACT TCC GGC AGC TCA AGC ACA GTG GGC TAC CCT GGT 

170 1*1 100 

Lys Tvr Pro Ser Val He Ala Val Glv Ala Val Asp Ser Ser Asn Gin Arn Ala Ser Phn Ser Ser Val Glv Pro 
9?4 AAA TAC CCT TCT GTC ATT GCA GTA GGC GCT GTT GAC AGC AGC AAC CAA AGA GCA TCT TTC TCA AGC GTA GGA CCT 

?nn ?in 
Glu Leu Asp Val Met Ala Pro Glv Val Ser He Gin Ser Thr Leu Pro Gly Asn Lvs Tvr Gly Ala Tvr Asn Glv 
009 GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA AGC ACG CTT CCT GGA AAC AAA TAC GGG GCG TAC AAC GGT 

2?0 ?y\ ?40 

Thr Ser Hpt Ala Ser Pro His Val Ala Glv Ala Ala Ala Leu lie Leu Ser Lvs His Pro Asn Trp Thr Asn Thr 
1074 ACG TCA ATG GCA TCT CCG CAC GTT GCC GGA GCG GCT GCT TTG ATT CTT TCT AAG CAC CCG AAC TGG ACA AAC ACT 

?50 Gin ?fin 
Gin Val Arq Ser Ser Leu Glu Asn Thr Thr Thr Lys Leu Gly Asp Ser Phe Tyr Tyr Glv Lvs Gly Leu We Asn 
1140 CAA GTC CGC AGC AGT TTA GAA AAC ACC ACT ACA AAA CTT GGT GAT TCT TTC TAC TAT GGA AAA GGG CTG ATC AAC 

?70 ?75 TPBM 

Val Gin Ala Ala Ala Gin oc I trCtVl 

1?74 GTA CAG GCG GCA GCT CAG TAA AAC ATAAAAAACCGGC CTTGGCCCC GCCGGTTTTTTATT A TTTTT CTTCCTCCGCATGTTCAATCCGCTCC 

131 A ATAATCGACGGATGGCTCCCTCTGAAAATTTTAACGAWAACGGCGGGT 

141fi CTTCCCGGTTTCCGGTCAGCTCAATGCCGTAACGGTCGGCGGCGTTTTCCTGATACCGGGAGACGGCATTCGTAATCGGATC 
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Fig. 2. 

Panel As probe I [^P-AaCaA^ATGGA^Gt] 
I 2 4 5 
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Fig. U. 



I 2 3 4 



B. amyloliquefaciens- 
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Fig! 



1 GATATACCTAAATAGAGATAAAATCATCTCAAAAAAATGGGTCTACTAAAATATTATTCCATCTATTACAATAAATTCACAGAATAGTCTTTTAAGTAAG 

-100 

fHet Arg Ser Lys Lys Leu Trp lie Ser Leo Leu Phe Ala Leu Thr Leu 
101 TCTACTCTGAATTTTTTTAAAAGGAGAGGGTAAAGA GTG AGA AGC AAA AAA TTG TGG ATC AGC TTG TTG TTT GCG TTA ACG TTA 

-90 -80 -70 

lie Phe Thr Met Ala Phe Ser Asn Met Ser Ala Gin Ala Al-a Gly Lys Ser Ser Thr Glu Lys Lys Tyr He Val 
185 ATC TTT ACG ATG GCG TTC AGC AAC ATG TCT GCG CAG GCT GCC GGA AAA AGC AGT ACA GAA AAG AAA TAC ATT GTC 

-60 -50 
Gly Phe Lys Gin Thr Met Ser Ala Met Ser Ser Ala Lys Lys Lys Asp Val lie Ser Glu Lys Gly Gly Lys Val 
260 GGA TTT AAA CAG ACA ATG AGT GCC ATG AGT TCC GCC AAG AAA AAG GAT GTT ATT TCT GAA AAA GGC GGA AAG GTT 



-40 -30 -20 

Gin Lys Gin Phe Lys Tyr Val Asn Ala Ala Ala Ala Thr Leu Asp Glu Lys Ala Val Lys Glu Leu Lys Lys Asp 
335 CAA AAG CAA TTT AAG TAT GTT AAC GCG GCC GCA GCA ACA TTG GAT GAA AAA GCT 6TA AAA GAA TTG AAA AAA GAT 

-10 -11 10 * 

Pro Ser Val Ala Tyr Val Glu Glu Asp His lie Ala His Glu Tyr Ala Gin Ser Val Pro Tyr Gly He Ser Gin 
410 CCG AGC GTT GCA TAT GTG GAA GAA GAT CAT ATT GCA CAT GAA TAT GCG CAA TCT GTT CCT TAT GGC ATT TCT CAA 

20 30 32 

lie Lys Ala Pro Ala Leu His Ser .Gin Gly Tyr Thr Gly Ser Asn Val Lys Val Ala Val He Asp Ser Gly lie 
485 ATT AAA GCG CCG GCT CTT CAC TCT CAA GGC TAC ACA GGC TCT AAC GTA AAA GTA GCT GTT ATC GAC AGC GGA ATT 

40 50 60 

Asp Ser Ser His Pro Asp Leu Asn Val Arg Gly Gly Ala Ser Phe Val Pro Ser Glu Thr Asn Pro Tyr Gin Asp 
560 GAC TCT TCT CAT CCT GAC TTA AAC GTC AGA GGC GGA GCA AGC TTC GTA CCT TCT GAA ACA AAC CCA TAC CAG GAC 

64 70 60 

Gly Ser Ser His Gly Thr His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu Gly Val Ser 
635 GGC AGT TCT CAC GGT ACG CAT GTA GCC GGT ACG ATT GCC GCT CTT AAT AAC TCA ATC GGT GTT CTG GGC GTT AGC 

90 100 110 

Pro Ser Ala Ser Leu Tyr Ala Val Lys Val Leu Asp Ser Thr Gly Ser Gly Gin Tyr Ser Trp He He Asn Gly 
710 CCA AGC GCA TCA TTA TAT GCA GTA AAA GTG CTT GAT TCA ACA GGA AGC GGC CAA TAT AGC TGG ATT ATT AAC GGC 

120 130 
lie Glu Trp Ala He Ser Asn Asn Met Asp Val He Asn Met Ser Leu Gly Gly Pro Thr Gly Ser Thr Ala Leu 
785 ATT GAG TGG GCC ATT TCC AAC AAT ATG GAT GTT ATC AAC ATG AGC CTT GGC GGA CCT ACT GGT TCT ACA GCG CTG 

140 150 160 

Lys Thr Val Val Asp Lys Ala Val Ser Ser Gly He Val Val Ala Ala Ala Ala Gly Asn Glu Gly Ser Ser Gly 
860 AAA ACA GTC GTT GAC AAA GCC GTT TCC AGC GGT ATC GTC GTT GCT GCC GCA GCC GGA AAC GAA GGT TCA TCC GGA 

170 180 
Ser Thr Ser Thr Val Gly Tyr Pro Ala Lys Tyr Pro Ser Thr He Ala Val Gly Ala Val Asn Ser Ser Asn Gin 
935 AGC ACA AGC ACA GTC GGC TAC CCT GCA AAA TAT CCT TCT ACT ATT GCA GTA GGT GCG GTA AAC AGC AGC AAC CAA 

190 200 210 

Arg Ala Ser Phe Ser Ser Ala Gly Ser Glu Leu Asp Val Met Ala Pro Gly Val Ser lie Gin Ser Thr Leu Pro 
1010 AGA GCT TCA TTC TCC AGC GCA GGT TCT GAG CTT GAT GTG ATG GCT CCT GGC GTG TCC ATC CAA AGC ACA CTT CCT 

220 221 230 
Gly Gly Thr Tyr Gly Ala Tyr Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala Ala Ala Leu He Leu 
1085 GGA GGC ACT TAC GGC GCT TAT AAC GGA ACG TCC ATG GCG ACT CCT CAC GTT GCC GGA GCA GCA GCG TTA ATT CTT 

240 250 260 

Ser Lys His Pro Thr Trp Thr Asn Ala Gin Val Arg Asp Arg Leu Glu Ser Thr Ala Thr Tyr. Leu Gly Asn Ser 
1160 TCT AAG CAC CCG ACT TGG ACA AAC GCG CAA GTC CGT GAT CGT TTA GAA AGC ACT GCA ACA TAT CTT GGA AAC TCT 

270 

Phe Tyr Tyr Gly Lys Gly Leu He Asn Val Gin Ala Ala Ala Gin OC 
1235 TTC TAC TAT GGA AAA GGG TTA ATC AAC GTA CAA GCA GCT GCA CAA TAA TAGTAAAAAGAAGCAGGTTCCTCCATACCTGCTTC 



1318 TTTTTATTTGTCAGCATCCTGATGTTCCGGCGCATTCTCTTCTTTCTCCGCATGTTGAATCCGTTCCATGATCGACGGATGGCTGCCTCTGAAAATCTTC 
1418 ACAA6CACCG6AGGATCAACCTGCTCA6CCCCGTCACGGCCAAATCCTGAAACGTTTTAACACTGGCTTCTCTGTTCTCTGTC 
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